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O (54) Title: COMPOSITIONS AND METHODS FOR THE THERAPY AND DIAGNOSIS OF LUNG CANCER 

0 (57) Abstract: Compositions and methods for the therapy and diagnosis of cancer, particularly lung cancer, are disclosed. Diustra- 

Otive compositions comprise one or more lung tumor polypeptides, immunogenic portions thereof, polynucleotides that encode such 
polypeptides, anU'gcn presenting cell that expresses such polypeptides, and T cells that are specific for cells expressing such polypep- 
tides. The disclosed compositions are useful, for example, in the diagnosis, prevention and/or treatment of diseases, particularly lung 
^ cancer. 
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COMPOSITIONS AND METHODS FOR THE THERAPY 
AND DIAGNOSIS OF LUNG CANCER 



TECHNICAL FIELD OF THE INVENTION 
5 The present invention relates generally to therapy and diagnosis of 

cancer, such as lung cancer. The invention is more specifically related to polypeptides, 
comprising at least a portion of a lung tumor protein, and to polynucleotides encoding 
such polypeptides. Such polypeptides and polynucleotides are useful in pharmaceutical 
compositions, e.g., vaccines, and other compositions for the diagnosis and treatment of 
10 lung cancer. 

BACKGROUND OF THE INVENTION 

Cancer is a significant health problem throughout the world. Although 

advances have been made in detection and therapy of cancer, no vaccine or other 

universally successful method for prevention and/or treatment is currently available. 
15 Current therapies, which are generally based on a combination of chemotherapy or 

surgery and radiation, continue to prove inadequate in many patients. 

Lung cancer is the primary cause of cancer death among both men and 

women in the U.S., with an estimated 172,000 new cases being reported in 1994. The 

five-year survival rate among all lung cancer patients, regardless of the stage of disease 
20 at diagnosis, is only 13%. This contrasts with a five-year survival rate of 46% among 

cases detected while the disease is still localized. However, only 16% of lung cancers 

are discovered before the disease has spread. 

In spite of considerable research into therapies for these and other 

cancers, lung cancer remains difficult to diagnose and treat effectively. Accordingly, 
25 there is a need in the art for improved methods for detecting and treating such cancers. 

The present invention fulfills these needs and further provides other related advantages. 
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SUMMARY OF THE INVENTION 

In one aspect, the present invention provides polynucleotide 
compositions comprising a sequence selected from the group consisting of: 

(a) sequences provided in SEQ ID NOs:l-3, 6-8, 10-13, 15-27, 29, 
5 30, 32, 34-49, 51, 52, 54, 55, 57-59, 61-69, 71, 73, 74, 77, 78, 80-82, 84, 86-96, 107- 
109, 111, 113, 125, 127, 128, 129, 131-133, 142, 144, 148-151, 153, 154, 157, 158, 
160, 167, 168, 171, 179, 182, 184-186, 188-191, 193, 194, 198-207, 209, 210, 213, 214, 
217, 220-224, 253-337, 345, 347, 349, 358, 362, 364, 365, 368, 370-375, 420, 424, 428, 
431, 434, 442, 447, 450 and 467; 

10 C>) complements of the sequences provided in SEQ ID NOs.T-3, 6-8, 

10-13, 15-27, 29, 30, 32, 34-49, 51, 52, 54, 55, 57-59, 61-69, 71, 73, 74, 77, 78, 80-82, 
84, 86-96, 107-109, 111, 113, 125, 127, 128, 129, 131-133, 142, 144, 148-151, 153, 
154, 157, 158, 160, 167, 168, 171, 179, 182, 184-186, 188-191, 193, 194, 198-207, 209, 
210, 213, 214, 217, 220-224, 253-337, 345, 347, 349, 358, 362, 364, 365, 368, 370-375, 

1 5 420, 424, 428, 43 1 , 434, 442, 447, 450 and 467; 

(c) sequences consisting of at least 10, 15, 20, 25, 30, 35, 40, 45, 50, 
75 and 100 contiguous residues of a sequence provided in SEQ ID NOs:l-3, 6-8, 10-13, 
15-27, 29, 30, 32, 34-49, 51, 52, 54, 55, 57-59, 61-69, 71, 73, 74, 77, 78, 80-82, 84, 86- 
96, 107-109, 111, 113, 125, 127, 128, 129, 131-133, 142, 144, 148-151, 153, 154, 157, 

20 158, 160, 167, 168, 171, 179, 182, 184-186, 188-191, 193, 194, 198-207, 209, 210, 213, 
214, 217, 220-224, 253-337, 345, 347, 349, 358, 362, 364, 365, 368, 370-375, 420, 424, 
428, 43 1 , 434, 442, 447, 450 and 467; 

v ( d ) sequences that hybridize to a sequence provided in SEQ ID 

NOs:l-3, 6-8, 10-13, 15-27, 29, 30, 32, 34-49, 51, 52, 54, 55, 57-59, 61-69, 71, 73, 74, 
25 77, 78, 80-82, 84, 86-96, 107-109, 111, 113, 125, 127, 128, 129, 131-133, 142, 144, 
148-151, 153, 154, 157, 158, 160, 167, 168, 171, 179, 182, 184-186, 188-191, 193, 194, 
198-207, 209, 210, 213, 214, 217, 220-224, 253-337, 345, 347, 349, 358, 362, 364, 365, 
368, 370-375, 420, 424, 428, 431, 434, 442, 447, 450 and 467, under moderate or 
highly stringent conditions; 
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(e) sequences having at least 75%, 80% 85%, 90%, 95%, 96%, 
97%, 98% or 99% identity to a sequence of SEQ ID NOs:l-3, 6-8, 10-13, 15-27, 29, 30, 
32, 34-49, 51, 52, 54, 55, 57-59, 61-69, 71, 73, 74, 77, 78, 80-82, 84, 86-96, 107-109, 
111, 113, 125, 127, 128, 129, 131-133, 142, 144, 148-151, 153, 154, 157, 158, 160, 

5 167, 168, 171, 179, 182, 184-186, 188-191, 193, 194, 198-207, 209, 210, 213, 214, 217, 
220-224, 253-337, 345, 347, 349, 358, 362, 364, 365, 368, 370-375, 420, 424, 428, 431, 
434, 442, 447, 450 and 467; and 

(f) degenerate variants of a sequence provided in SEQ ID NOs:l-3, 
6-8, 10-13, 15-27, 29, 30, 32, 34-49, 51, 52, 54, 55, 57-59, 61-69, 71, 73, 74, 77, 78, 80- 

10 82, 84, 86-96, 107-109, 111, 113, 125, 127, 128, 129, 131-133, 142, 144, 148-151, 153, 
154, 157, 158, 160, 167, 168, 171, 179, 182, 184-186, 188-191, 193, 194, 198-207,209, 
210, 213, 214, 217, 220-224, 253-337, 345, 347, 349, 358, 362, 364, 365, 368, 370-375, 
420, 424, 428, 431, 434, 442, 447, 450 and 467. 

In one preferred embodiment, the polynucleotide compositions of the 

15 invention are expressed in at least about 20%, more preferably in at least about 30%, 
and most preferably in at least about 50% of lung tumors samples tested, at a level that 
is at least about 2-fold, preferably at least about 5-fold, and most preferably at least 
about 10-fold higher than that for normal tissues. 

The present invention, in another aspect, provides polypeptide 

20 compositions comprising an amino acid sequence that is encoded by a polynucleotide 
sequence described above. 

The present invention further provides polypeptide compositions 
comprising an amino acid sequence selected from the group consisting of sequences 
recited in SEQ IDNOs:152, 155, 156, 165, 166, 169, 170, 172, 174, 176,226-252, 338- 

25 344, 346, 350, 357, 361, 363, 365, 367, 369, 376-382, 387-419, 423, 427, 430, 433, 
441, 443, 446, 449 and 451-466, . 

In certain preferred embodiments, the polypeptides and/or 
polynucleotides of the present invention are immunogenic,, i.e., they are capable of 
eliciting an immune response, particularly a humoral and/or cellular immune response, 

30 as further described herein. 
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The present invention further provides fragments, variants and/or 
derivatives of the disclosed polypeptide and/or polynucleotide sequences, wherein the 
fragments, variants and/or derivatives preferably have a level of immunogenic activity 
of at least about 50%, preferably at least about 70% and more preferably at least about 
5 90% of the level of immunogenic activity of a polypeptide sequence set forth in SEQ ID 
NOs:152, 155, 156, 165, 166, 169, 170, 172, 174, 176, 226-252, 338-344, 346, 350, 
357, 361, 363, 365, 367, 369, 376-382, 387-419, 423, 427, 430, 433, 441, 443, 446, 449 
and 451-466, or a polypeptide sequence encoded by a polynucleotide sequence set forth 
in SEQ ID NOs:l~3, 6-8, 10-13, 15-27, 29, 30, 32, 34-49, 51, 52, 54, 55, 57-59, 61-69, 
10 71, 73, 74, 77, 78, 80-82, 84, 86-96, 107-109, 111, 113, 125, 127, 128, 129, 131-133, 
142, 144, 148-151, 153, 154, 157, 158, 160, 167, 168, 171, 179, 182, 184-186, 188-191, 
193, 194, 198-207, 209, 210, 213, 214, 217, 220-224, 253-337, 345, 347, 349, 358, 362, 
364, 365, 368, 370-375, 420, 424, 428^ 431, 434, 442, 447, 450 and 467. 

The present invention further provides polynucleotides that encode a 
15 polypeptide described above, expression vectors comprising such polynucleotides and 
host cells transformed or transfected with such expression vectors. 

Within other aspects, the present invention provides pharmaceutical 
compositions comprising a polypeptide or polynucleotide as described above and a 
physiologically acceptable carrier. 
20 , Within a related aspect of the present invention, the pharmaceutical 

compositions, e.g., vaccine compositions, are provided for prophylactic or therapeutic 
applications. Such compositions generally comprise an immunogenic polypeptide or 
polynucleotide of the invention and an immunostimulant, such as an adjuvant. 

The present invention further provides pharmaceutical compositions that 
25 comprise: (a) an antibody or antigen-binding fragment thereof that specifically binds to 
a polypeptide of the present invention, or a fragment thereof; and (b) a physiologically 
acceptable carrier. 

Within further aspects, the present invention provides pharmaceutical 
compositions comprising: (a) an antigen presenting cell that expresses a polypeptide as 
30 described above and (b) a pharmaceutical^ acceptable carrier or excipient Illustrative 
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antigen presenting cells include dendritic cells, macrophages, monocytes, fibroblasts 
and B cells. 

Within related aspects, pharmaceutical compositions are provided that 
comprise: (a) an antigen presenting cell that expresses a polypeptide as described above 
and (b) an immunostimulant. 

The present invention further provides, in other aspects, fusion proteins 
that comprise at least one polypeptide as described above, as well as polynucleotides 
encoding such fusion proteins, typically in the form of pharmaceutical compositions, 
e.g., vaccine compositions, comprising a physiologically acceptable carrier and/or an 
immunostimulant. The fusions proteins may comprise multiple immunogenic 
polypeptides or portions/variants thereof, as described herein, and may further comprise 
one or more polypeptide segments for facilitating the expression, purification and/or 
immunogenicity of the polypeptide(s). 

Within further aspects, the present invention provides methods for 
stimulating an immune response in a patient, preferably a T cell response in a human 
patient, comprising administering a pharmaceutical composition described herein. The 
patient may be afflicted with lung cancer, in which case the methods provide treatment 
for the disease, or patient considered at risk for such a disease may be treated 
prophylactically. 

Within further aspects, the present invention provides methods for 
inhibiting the development of a cancer in a patient, comprising administering to a 
patient a pharmaceutical composition as recited above. The patient may be afflicted 
with lung cancer, in which case the methods provide treatment for the disease, or patient 
considered at risk for such a disease may be treated prophylactically. 

The present invention further provides, within other aspects, methods for 
removing tumor cells from a biological sample, comprising contacting a biological 
sample with T cells that specifically react with a polypeptide of the present invention, 
wherein the step of contacting is performed under conditions and for a time sufficient to 
permit the removal of cells expressing the protein from the sample. 
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Within related aspects, methods are provided for inhibiting the 
development of a cancer in a patient, comprising administering to a patient a biological 
sample treated as described above. 

Methods are further provided, within other aspects, for stimulating 
5 and/or expanding T cells specific for a polypeptide of the present invention, comprising 
contacting T cells with one or more of: (i) a polypeptide as described above; (ii) a 
polynucleotide encoding such a polypeptide; and/or (iii) an antigen presenting cell that 
expresses such a polypeptide; under conditions and for a time sufficient to permit the 
stimulation and/or expansion of T cells. Isolated T cell populations comprising T cells 
10 prepared as described above are also provided. 

Within further aspects, the present invention provides methods for 
inhibiting the development of a cancer in a patient, comprising administering to a 
patient an effective amount of a T cell population as described above. 

The present invention further provides methods for inhibiting the 
15 development of a cancer in a patient, comprising the steps of: (a) incubating CD4 + 
and/or CD8 + T cells isolated from a patient with one or more of: (i) a polypeptide 
comprising at least an immunogenic portion of polypeptide disclosed herein; (ii) a 
polynucleotide encoding such a polypeptide; and (iii) an antigen-presenting cell that 
expressed such a polypeptide; and (b) administering to the patient an effective amount 
20 of the proliferated T cells, and thereby inhibiting the development of a cancer in the 
patient. Proliferated cells may, but need not, be cloned prior to administration to the 
patient. 

Within further aspects, the present invention provides methods for 
determining the presence or absence of a cancer, preferably a lung cancer, in a patient 

25 comprising: (a) contacting a biological sample obtained from a patient with a binding 
agent that binds to a polypeptide as recited above; (b) detecting in the sample an amount 
of polypeptide that binds to the binding agent; and (c) comparing the amount of 
polypeptide with a predetermined cut-off value, and therefrom determining the presence 
or absence of a cancer in the patient. Within preferred embodiments, the binding agent 

30 is an antibody, more preferably a monoclonal antibody. 
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The present invention also provides, within other aspects, methods for 
monitoring the progression of a cancer in a patient Such methods comprise the steps 
of: (a) contacting a biological sample obtained from a patient at a first point in time 
with a binding agent that binds to a polypeptide as recited above; (b) detecting in the 
5 sample an amount of polypeptide that binds to the binding agent; (c) repeating steps (a) 
and (b) using a biological sample obtained from the patient at a subsequent point in 
time; and (d) comparing the amount of polypeptide detected in step (c) with the amount 
detected in step (b) and therefrom monitoring the progression of the cancer in the 
patient. 

10 The present invention further provides, within other aspects, methods for 

detenniiiing the presence or absence of a cancer in a patient, comprising the steps of: (a) 
contacting a biological sample obtained from a patient with an oligonucleotide that 
hybridizes to a polynucleotide that encodes a polypeptide of the present invention; (b) 
detecting in the sample a level of a polynucleotide, preferably mKN A, that hybridizes to 

15 the oligonucleotide; and (c) comparing the level of polynucleotide that hybridizes to the 
oligonucleotide with a predetermined cut-off value, and therefrom determining the 
presence or absence of a cancer in the patient. Within certain embodiments, the amount 
of mKNA is detected via polymerase chain reaction using, for example, at least one 
oligonucleotide primer that hybridizes to a polynucleotide encoding a polypeptide as 

20 recited above, or a complement of such a polynucleotide. Within other embodiments, 
the amount of mRNA is detected using a hybridization technique, employing an 
oligonucleotide probe that hybridizes to a polynucleotide that encodes a polypeptide as 
recited above, or a complement of such a polynucleotide. 

In related aspects, methods are provided for monitoring the progression 

25 of a cancer in a patient, comprising the steps of: (a) contacting a biological sample 
obtained from a patient with an oligonucleotide that hybridizes to a polynucleotide that 
encodes a polypeptide of the present invention; (b) detecting in the sample an amount of 
a polynucleotide that hybridizes to the oligonucleotide; (c) repeating steps (a) and (b) 
using a biological sample obtained from the patient at a subsequent point in time; and 

30 (d) comparing the amount of polynucleotide detected in step (c) with the amount 
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detected in step (b) and therefrom monitoring the progression of the cancer in the 
patient. 

Within further aspects, the present invention provides antibodies, such as 
monoclonal antibodies, that bind to a polypeptide as described above, as well as 
5 diagnostic kits comprising such antibodies. Diagnostic kits comprising one or more 
oligonucleotide probes or primers as described above are also provided. 

These and other aspects of the present invention will become apparent 
upon reference to the following detailed description and attached drawings. All 
references disclosed herein are hereby incorporated by reference in their entirety as if 
1 0 each was incorporated individually. 

SEQUENCE IDENTIFIERS 

SEQ ID NO.l is the determined cDNA sequence for LST-S1-2 

SEQ ID NO:2 is the determined cDNA sequence for LST-S1-28 

SEQ ID NO:3 is the determined cDNA sequence for LST-S 1-90 
15 SEQ ED NO:4 is the determined cDNA sequence for LST-S1-144 

SEQ ID NO:5 is the determined cDNA sequence for LST-S1-133 

SEQ ED NO:6 is the determined cDNA sequence for LST-S1-169 

SEQ ID NO:7 is the determined cDNA sequence for LST-S2-6 

SEQ ID NO:8 is the determined cDNA sequence for LST-S2-1 1 
20 SEQ ID NO:9 is the determined cDNA sequence for LST-S2-1 7 

SEQ ID NO: 10 is the determined cDNA sequence for LST-S2-25 

SEQ ED NO: 1 1 is the determined cDNA sequence for LST-S2-39 

SEQ ED NO: 12 is a first determined cDNA sequence for LST-S2-43 

SEQ ED NO: 13 is a second determined cDNA sequence for LST-S2-43 
25 SEQ ID NO: 14 is the determined cDNA sequence for LST-S2-65 

SEQ ED NO: 15 is the determined cDNA sequence for LST-S2-68 

SEQ ED NO: 16 is the determined cDNA sequence for LST-S2-72 

SEQ ED NO: 17 is the determined cDNA sequence for LST-S2-74 

SEQ ID NO: 18 is the determined cDNA sequence for LST-S2-103 
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SEQH)NO:19 
SEQIDNO:20 
SEQIDNO:21 
SEQIDNO:22 
5 SEQIDNO:23 
SEQIDNO:24 
SEQIDNO:25 
SEQIDNO.26 
SEQIDNO:27 

10 SEQIDNO:28 
SEQIDNO:29 
SEQIDNO:30 
SEQIDNO:31 
SEQIDNO:32 

15 SEQEDNO:33 
SEQIDNO.34 
SEQIDNO:35 
SEQIDNO:36 
SEQIDNO:37 

20 SEQIDNO:38 
SEQIDNO:39 
SEQIDNO:40 
SEQIDNO:41 
SEQIDNO:42 

25 SEQIDNO:43 
SEQIDNO:44 
SEQIDNO:45 
SEQIDNO:46 
SEQIDNO:47 

30 SEQIDNO-.48 



s the determined cDNA sequence for LST-S2-N1-1F 
is the determined cDNA sequence for LST-S2-N1-2A 
s the determined cDNA sequence for LST-S2-N1-4H 
s the determined cDNA sequence for LST-S2-N1-5A 
s the determined cDNA sequence for LST-S2-N1-6B 
s the determined cDNA sequence for LST-S2-N1-7B 
s the determined cDNA sequence for LST-S2-N1-7H 
s me detennined cDNA sequence for LST-S2-N1-8A 
is the detennined cDNA sequence for LST-S2-N1-8D 
s the detennined cDNA sequence for LST-S2-N1-9A 
s the determined cDNA sequence for LST-S2-N1-9E 
s the detennined cDNA sequence for LST-S2-N1-10A 
s the detennined cDNA sequence for LST-S2-N1-10G 
s the determined cDNA sequence for LST-S2-N1-1 1 A 
is the determined cDNA sequence for LST-S2-N1-12C 
s the determined cDNA sequence for LST-S2-N1-12E 
is the detennined cDNA sequence for LST-S2-B1-3D 
s the detennined cDNA sequence for LST-S2-B1-6C 
s the determined cDNA sequence for LST-S2-B1-5D 
s the determined cDNA sequence for LST-S2-B1-5F 
s the determined cDNA sequence for LST-S2-B1-6G 
s the determined cDNA sequence for LST-S2-B1-8A 
is the determined cDNA sequence for LST-S2-B1-8D 
s the determined cDNA sequence for LST-S2-B1-10A 
s the determined cDNA sequence for LST-S2-B1-9B 
s die determined cDNA sequence for LST-S2-B1-9F 
s the determined cDNA sequence for LST-S2-B1-12D 
s the detennined cDNA sequence for LST-S2-I2-2B 
s the determined cDNA sequence for LST-S2-I2-5F 
s the determined cDNA sequence for LST-S2-I2-6B 
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SEQ ID NO:49 is the determined cDNA sequence for LST-S2-I2-7F 
SEQ ID NO:50 is the determined cDNA sequence for LST-S2-E-8G 
SEQ ID NO:5 1 is the determined cDNA sequence for LST-S2-I2-9E 
SEQ ID NO:52 is the determined cDNA sequence for LST-S2-I2-12B 
5 SEQ ID NO:53 is the determined cDNA sequence for LST-S2-H2-2C 
SEQ ID NO:54 is the determined cDNA sequence for LST-S2-H2-1G 
SEQ ID NO:55 is the determined cDNA sequence for LST-S2-H2-4G 
SEQ ID NO:56 is the determined cDNA sequence for LST-S2-H2-3H 
SEQ ID NO:57 is the determined cDNA sequence for LST-S2-H2-5G 
1 0 SEQ ID NO:58 is the determined cDNA sequence for LST-S2-H2-9B 
SEQ ID NO:59 is the determined cDNA sequence for LST-S2-H2-10H 
SEQ ID NO:60 is the determined cDNA sequence for LST-S2-H2-12D 
SEQ ID NO: 61 is the determined cDNA sequence for LST-S3-2 
SEQ ID NO: 62 is the determined cDNA sequence for LST-S3-4 

1 5 SEQ ID NO: 63 is the determined cDNA sequence for LST-S3-7 
SEQ ID NO: 64 is the determined cDNA sequence for LST-S3-8 
SEQ ID NO: 65 is the determined cDNA sequence for LST-S3-12 
SEQ ID NO: 66 is the determined cDNA sequence for LST-S3-13 
SEQ ID NO: 67 is the determined cDNA sequence for LST-S3-14 

20 SEQ ED NO: 68 is the determined cDNA sequence for LST-S3-1 6 
SEQ ID NO: 69 is the determined cDNA sequence for LST-S3-21 
SEQ ID NO: 70 is the determined cDNA sequence for LST-S3-22 
SEQ ID NO: 71 is the determined cDNA sequence for LST-S1-7 
SEQ ID NO: 72 is the determined cDNA sequence for LST-S1-A-1E 

25 SEQ ID NO: 73 is the determined cDNA sequence for LST-S 1 -A- 1 G 
SEQ ID NO: 74 is the determined cDNA sequence for LST-S 1-A-3E 
SEQ ID NO: 75 is the detennined cDNA sequence for LST-S1-A^E 
SEQ ID NO: 76 is the detennined cDNA sequence for LST-S1-A-6D 
SEQ ID NO: 77 is the detennined cDNA sequence for LST-S 1-A-8D 

30 SEQ ID NO: 78 is the determined cDNA sequence for LST-S1 -A-l OA 
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SEQ ID NO: 79 is the determined cDNA sequence for LST-S1-A-10C 
SEQ DD NO: 80 is the determined cDNA sequence for LST-S1-A-9D 
SEQ ID NO: 81 is the determined cDNA sequence for LST-S1-A-10D 
SEQ ID NO: 82 is the determined cDNA sequence for LST-S1-A-9H 

5 SEQ ID NO: 83 is the determined cDNA sequence for LST-S1-A-1 ID 
SEQ ID NO: 84 is the determined cDNA sequence for LST-S1-A-12D 
SEQ ID NO: 85 is the determined cDNA sequence for LST-S1-A-1 IE 
SEQ ID NO: 86 is the determined cDNA sequence for LST-S 1-A-l 2E 
SEQ ID NO: 87 is the determined cDNA sequence for L513S (T3). 

1 0 SEQ ID NO: 88 is the determined cDNA sequence for L5 13S contig 1 . 
SEQ ID NO: 89 is a first determined cDNA sequence for L514S. 
SEQ ED NO: 90 is a second determined cDNA sequence for L514S. 
SEQ ID NO: 91 is a first determined cDNA sequence for L516S. 
SEQ ID NO: 92 is a second determined cDNA sequence for L516S. 

15 SEQ ED NO: 93 is the determined cDNA sequence for L517S. 

SEQ ID NO: 94 is the extended cDNA sequence for LST-S1-169 (also known 
L519S). 

SEQ ED NO: 95 is a first determined cDNA sequence for L520S. 

SEQ ED NO: 96 is a second determined cDNA sequence for L520S. 
20 SEQ ID NO: 97 is a first determined cDNA sequence for L52 1 S. 

SEQ ED NO: 98 is a second determined cDNA sequence for L521S. 

SEQ ID NO: 99 is the determined cDNA sequence for L522S. 

SEQ ID NO: 1 00 is the determined cDNA sequence for L523S. 

SEQ ID NO: 1 0 1 is the determined cDNA sequence for L524S. 
25 SEQ ID NO: 1 02 is the determined cDNA sequence for L525S. 

SEQ ID NO: 103 is the determined cDNA sequence for L526S. 

SEQ ID NO: 104 is the determined cDNA sequence for L527S. 

SEQ ID NO: 1 05 is the determined cDNA sequence for L528S. 

SEQ ID NO: 106 is the determined cDNA sequence for L529S. 
30 SEQ ID NO: 1 07 is a first determined cDNA sequence for L530S. 
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SEQ ID NO: 108 is a second detennined cDNA sequence for L530S. 

SEQ ID NO: 1 09 is the detennined full-length cDNA sequence for L53 1 S short form 

SEQ ID NO: 1 10 is the amino acid sequence encoded by SEQ ID NO: 109. 

SEQ ID NO: 1 1 1 is the determined full-length cDNA sequence for L53 IS long form 

SEQ ID NO: 1 12 is the amino acid sequence encoded by SEQ ID NO: 111. 

SEQ ID NO: 1 13 is the determined full-length cDNA sequence for L520S. 

SEQ ID NO: 1 14 is the amino acid sequence encoded by SEQ ID NO: 1 13. 

SEQ ID NO: 1 1 5 is the determined cDNA sequence for contig 1 . 

SEQ ID NO: 1 16 is the determined cDNA sequence for contig 3. 

SEQ ID NO: 1 1 7 is the determined cDNA sequence for contig 4. 

SEQ ID NO: 1 18 is the detennined cDNA sequence for contig 5. 

SEQ ID NO: 1 19 is the determined cDNA sequence for contig 7. 

SEQ ID NO: 120 is the determined cDNA sequence for contig 8. 

SEQ ID NO: 121 is the determined cDNA sequence for contig 9. 

SEQ ID NO: 122 is the determined cDNA sequence for contig 10. 

SEQ ID NO: 123 is the determined cDNA sequence for contig 12. 

SEQ ID NO: 124 is the determined cDNA sequence for contig 1 1 . 

SEQ ID NO: 125 is the determined cDNA sequence for contig 13 (also known as 

L761P). 

SEQ ID NO: 126 is the determined cDNA sequence for contig 15. 
SEQ ID NO: 127 is the detennined cDNA sequence for contig 16. 
SEQ ID NO: 128 is the determined cDNA sequence for contig 17. 
SEQ ID NO: 1 29 is the determined cDNA sequence for contig 1 9. 
SEQ ID NO: 1 30 is the determined cDNA sequence for contig 20. 
SEQ ID NO: 1 3 1 is the determined cDNA sequence for contig 22. 
SEQ ID NO: 1 32 is the determined cDNA sequence for contig 24. 
SEQ ID NO: 133 is the determined cDNA sequence for contig 29. 
SEQ ID NO: 1 34 is the determined cDNA sequence for contig 3 1 . 
SEQ ID NO: 135 is the determined cDNA sequence for contig 33. 
SEQ ID NO: 1 36 is the determined cDNA sequence for contig 38. 
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SEQ ID NO: 137 is the detennined cDNA sequence for contig 39. 

SEQ ID NO: 138 is the determined cDNA sequence for contig 41. 

SEQ ID NO: 139 is the detennined cDNA sequence for contig 43. 

SEQ ID NO: 140 is the determined cDNA sequence for contig 44. 
5 SEQ ID NO: 14 1 is the determined cDNA sequence for contig 45. 

SEQ ID NO: 142 is the determined cDNA sequence for contig 47. 

SEQ ID NO: 143 is the determined cDNA sequence for contig 48. 

SEQ ID NO: 144 is the determined cDNA sequence for contig 49. 

SEQ ID NO: 145 is the determined cDNA sequence for contig 50. 
1 0 SEQ ID NO: 146 is the determined cDNA sequence for contig 53. 

SEQ ID NO: 147 is the detennined cDNA sequence for contig 54. 

SEQ ID NO: 148 is the determined cDNA sequence for contig 56. 
SEQ ID NO: 149 is the determined cDNA sequence for contig 57. 
SEQ ID NO: 1 50 is the determined cDNA sequence for contig 58. 
15 SEQ ID NO: 151 is the full-length cDNA sequence for L530S. 

SEQ ID NO: 1 52 is the amino acid sequence encoded by SEQ ID NO: 151 
SEQ ID NO: 153 is the full-length cDNA sequence of a first variant of L514S 
SEQ ID NO: 154 is the full-length cDNA sequence of a second variant of L514S 
SEQ ID NO: 1 55 is the amino acid sequence encoded by SEQ ID NO: 1 53. 
20 SEQ ID NO: 1 56 is the amino acid sequence encoded by SEQ ID NO: 1 54. 
SEQ ID NO: 1 57 is the determined cDNA sequence for contig 59. 
SEQ ID NO: 158 is the full-length cDNA sequence for L763P (also referred to as contig 
22). 

SEQ ID NO: 159 is the amino acid sequence encoded by SEQ ID NO: 158. 
25 SEQ ID NO: 160 is the full-length cDNA sequence for L762P (also referred to as contig 

17). 

SEQ ID NO: 1 61 is the amino acid sequence encoded by SEQ ID NO: 160. 
SEQ ID NO: 162 is the determined cDNA sequence for L515S. 
SEQ ID NO: 163 is the full-length cDNA sequence of a first variant of L524S. 
30 SEQ ID NO: 1 64 is the full-length cDNA sequence of a second variant of L524S. 
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SEQ ID NO: 165 is the amino acid sequence encoded by SEQ ID NO: 163. 
SEQ ID NO: 166 is the amino acid sequence encoded by SEQ ID NO: 164. 
SEQ ID NO: 1 67 is the full-length cDNA sequence of a first variant of L762P. 
SEQ ID NO: 168 is the full-length cDNA sequence of a second variant of L762P. 
5 SEQ ID NO: 169 is the amino acid sequence encoded by SEQ ID NO: 1 67. 
SEQ ID NO: 170 is the amino acid sequence encoded by SEQ ID NO: 168. 
SEQ ID NO: 171 is the full-length cDNA sequence for L773P (also referred to as contig 
56). 

SEQ ID NO: 172 is the amino acid sequence encoded by SEQ ID NO: 171. 
10 SEQ ID NO: 173 is an extended cDNA sequence for L519S. 

SEQ ID NO: 174 is the amino acid sequence encoded by SEQ ID NO: 174. 

SEQ ID NO: 175 is the full-length cDNA sequence for L523S. 

SEQ ID NO: 176 is the amino acid sequence encoded by SEQ ID NO: 175. 

SEQ ID NO: 177 is the determined cDNA sequence for LST-sub5-7A. 
1 5 SEQ ID NO: 1 78 is the determined cDNA sequence for LST-sub5-8G. 

SEQ ID NO: 1 79 is the determined cDNA sequence for LST-sub5-8H. 

SEQ ID NO: 180 is the determined cDNA sequence for LST-sub5-10B. 

SEQ ID NO: 1 81 is the determined cDNA sequence for LST-sub5-l OH. 

SEQ ID NO: 182 is the determined cDNA sequence for LST-sub5-12B. 
20 SEQ ID NO: 183 is the determined cDNA sequence for LST-sub5-l 1C. 

SEQ ID NO: 184 is the determined cDNA sequence forLST-sub6-lc. 

SEQ ID NO: 1 85 is the determined cDNA sequence for LST-sub6-2f. 

SEQ ID NO: 1 86 is the determined cDNA sequence for LST-sub6-2G. 

SEQ ID NO: 1 87 is the determined cDNA sequence for LST-sub6-4d. 
25 SEQ ID NO: 1 88 is the determined cDNA sequence for LST-sub6-4e. 

SEQ ID NO: 1 89 is the determined cDNA sequence for LST-sub6-4f. 

SEQ ID NO: 1 90 is the determined cDNA sequence for LST-sub6-3h. 

SEQ ID NO: 1 91 is the determined cDNA sequence for LST-sub6-5d. 

SEQ ID NO: 192 is the determined cDNA sequence for LST-sub6-5h. 
30 SEQ ID NO: 193 is the determined cDNA sequence for LST-sub6-6h. 
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SEQ ID NO: 194 is the determined cDNA sequence for LST-sub6-7a. 

SEQ ID NO: 195 is the determined cDNA sequence for LST-sub6-8a. 

SEQ ID NO: 196 is the determined cDNA sequence for LST-sub6-7d. 

SEQ ID NO: 197 is the determined cDNA sequence for LST-sub6-7e. 
5 SEQ ID NO: 198 is the determined cDNA sequence for LST-sub6-8e. 

SEQ ID NO: 199 is the determined cDNA sequence for LST-sub6-7g. 

SEQ ID NO: 200 is the determined cDNA sequence for LST-sub6-9f. 

SEQ ID NO: 201 is the determined cDNA sequence for LST-sub6-9h. 

SEQ ID NO: 202 is the determined cDNA sequence for LST-sub6-l lb. 
10 SEQ ID NO: 203 is the determined cDNA sequence for LST-sub6-l lc. 

SEQ ID NO: 204 is the determined cDNA sequence for LST-sub6-12c. 

SEQ ID NO: 205 is the determined cDNA sequence for LST-sub6-12e. 

SEQ ID NO: 206 is the determined cDNA sequence for LST-sub6-12f. 

SEQ ID NO: 207 is the determined cDNA sequence for LST-sub6-l lg. 
15 SEQ ID NO: 208 is the determined cDNA sequence for LST-sub6-12g. 

SEQ ID NO: 209 is the determined cDNA sequence for LST-sub6-12h. 

SEQ ID NO: 210 is the determined cDNA sequence for LST-sub6-IMa. 

SEQ ID NO: 21 1 is the determined cDNA sequence for LST-sub6-II-2b. 

SEQ ID NO: 212 is the determined cDNA sequence for LST-sub6-II-2g. 
20 SEQ ID NO: 213 is the determined cDNA sequence for LST-sub6-II-lhu 

SEQ ID NO: 214 is the determined cDNA sequence for LST-sub6-II-4a. 

SEQ ID NO: 21 5 is the determined cDNA sequence for LST-sub6-II-4b. 

SEQ ID NO: 216 is the determined cDNA sequence for LST-sub6-II-3e. 

SEQ ID NO: 217 is the determined cDNA sequence for LST-sub6-H-4f. 
25 SEQ ID NO: 218 is the determined cDNA sequence for LST-sub6-H-4g. 

SEQ ID NO: 219 is the determined cDNA sequence for LST-sub6-II-4h. 

SEQ ID NO: 220 is the determined cDNA sequence for LST-sub6-II-5c. 

SEQ ID NO: 221 is the determined cDNA sequence for LST-sub6-II-5e. 

SEQ ID NO: 222 is the determined cDNA sequence for LST-sub6-II-6f. 
30 SEQ ID NO: 223 is the determined cDNA sequence for LST-sub6-H-5g. 
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SEQ ID NO: 224 is the determined cDNA sequence for LST-sub6-II-6g. 

SEQ ID NO: 225 is the amino acid sequence for L528S. 

SEQ ID NO: 226-25 1 are synthetic peptides derived from L762P. 

SEQ ID NO: 252 is the expressed amino acid sequence of L514S. 
5 SEQ ID NO: 253 is the DNA sequence corresponding to SEQ ID NO: 252. 

SEQ ID NO: 254 is the DNA sequence of a L762P expression construct. 

SEQ ID NO: 255 is the determined cDNA sequence for clone 23785. 

SEQ ID NO: 256 is the detennined cDNA sequence for clone 23786. 

SEQ ID NO: 257 is the determined cDNA sequence for clone 23788. 
1 0 SEQ ID NO: 258 is the detennined cDNA sequence for clone 23790. 

SEQ ID NO: 259 is the determined cDNA sequence for clone 23793. 

SEQ ID NO: 260 is the determined cDNA sequence for clone 23794. 

SEQ ID NO: 261 is the determined cDNA sequence for clone 23795. 

SEQ ID NO: 262 is the, determined cDNA sequence for clone 23796. 
1 5 SEQ ID NO: 263 is the determined cDNA sequence for clone 23797. 

SEQ ID NO: 264 is the determined cDNA sequence for clone 23798. 

SEQ ID NO: 265 is the determined cDNA sequence for clone 23799. 

SEQ ID NO: 266 is the determined cDNA sequence for clone 23800. 

SEQ ID NO: 267 is the determined cDNA sequence for clone 23802. 
20 SEQ ID NO: 268 is the determined cDNA sequence for clone 23803. 

SEQ ID NO: 269 is the determined cDNA sequence for clone 23804. 

SEQ ID NO: 270 is the determined cDNA sequence for clone 23805. 

SEQ ID NO: 271 is the determined cDNA sequence for clone 23806. 

SEQ ID NO: 272 is the determined cDNA sequence for clone 23807. 
25 SEQ ID NO: 273 is the determined cDNA sequence for clone 23808. 

SEQ ID NO: 274 is the determined cDNA sequence for clone 23809. 

SEQ ID NO: 275 is the determined cDNA sequence for clone 23810. 

SEQ ID NO: 276 is the determined cDNA sequence for clone 238 1 1 . 

SEQ ID NO: 277 is the determined cDNA sequence for clone 238 1 2. 
30 SEQ ID NO: 278 is the determined cDNA sequence for clone 23813. 
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SEQ ID IvfO: 279 is the determined cDNA sequence for clone 23815, 

SEQIDNO: 280 is the detennined cDNA sequence for clone 25298. 

SEQ ID NO: 281 is the determined cDNA sequence for clone 25299. 

SEQ ID NO: 282 is the determined cDNA sequence for clone 25300. 
5 SEQ ID NO: 283 is the detennined cDNA sequence for clone 25301 

SEQ ID NO: 284 is the determined cDNA sequence for clone 25304 

SEQ ID NO: 285 is the determined cDNA sequence for clone 25309. 

SEQ ID NO: 286 is the detennined cDNA sequence for clone 253 12. 

SEQ ID NO: 287 is the detennined cDNA sequence for clone 25317. 
10 SEQ ID NO:288 is the determined cDNA sequence for clone 25321 . 

SEQ ID NO:289 is the determined cDNA sequence for clone 25323. 

SEQ ID NO:290 is the determined cDNA sequence for clone 25327. 

SEQ ID NO:291 is the determined cDNA sequence for clone 25328. 

SEQ ID NO:292 is the determined cDNA sequence for clone 25332. 
1 5 SEQ ID NO:293 is the determined cDNA sequence for clone 25333. 

SEQ ID NO:294 is the determined cDNA sequence for clone 25336. 

SEQ ID NO:295 is the determined cDNA sequence for clone 25340. 

SEQ ID NO:296 is the detennined cDNA sequence for clone 25342. 

SEQ ID NO:297 is the determined cDNA sequence for clone 25356. 
20 SEQ ID NO:298 is the determined cDNA seqdence for clone 25357. 

SEQ ID NO:299 is the determined cDNA sequence for clone 25361. 

SEQ ID NO:300 is the determined cDNA sequence for clone 25363. 

SEQ ID NO:301 is the determined cDNA sequence for clone 25397. 

SEQ ID NO:302 is the detennined cDNA sequence for clone 25402. 
25 SEQ ID NO:303 is the determined cDNA sequence for clone 25403. 

SEQ ID NO:304 is the determined cDNA sequence for clone 25405. 

SEQ IDNO:305 is the detennined cDNA sequence for clone 25407. 

SEQ ID NO:306 is the determined cDNA sequence for clone 25409. 

SEQ ID NO:307 is the determined cDNA sequence for clone 25396. 
30 SEQ ID NO:308 is the determined cDNA sequence for clone 25414. 
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SEQ ID NO:309 is the determined cDNA sequence for clone 25410. 

SEQ ID NO:310 is the determined cDNA sequence for clone 25406. 

SEQ ID NO:31 1 is the determined cDNA sequence for clone 25306. 

SEQ ID NO:3 12 is the determined cDNA sequence for clone 25362. 
5 SEQ ID NO:3 1 3 is the determined cDNA sequence for clone 25360. 

SEQ ID NO:3 14 is the determined cDNA sequence for clone 25398. 

SEQ ID NO:3 15 is the determined cDNA sequence for clone 25355. 

SEQ ID NO:3 16 is the determined cDNA sequence for clone 25351 . 

SEQ ID NO:3 1 7 is the determined cDNA sequence for clone 2533 1 . 
10 SEQ ID NO:3 1 8 is the determined cDNA sequence for clone 25338. 

SEQ ID NO:3 19 is the determined cDNA sequence for clone 25335. 

SEQ ID NO:320 is the determined cDNA sequence for clone 25329. 

SEQ ID NO:321 is the determined cDNA sequence for clone 25324. 

SEQ ID NO:322 is the determined cDNA sequence for clone 25322. 
15 SEQ ID NO:323 is the determined cDNA sequence for clone 25319. 

SEQ ID NO:324 is the detennined cDNA sequence for clone 25316. 

SEQ ID NO:325 is the determined cDNA sequence for clone 2531 1. 

SEQ ID NO:326 is the determined cDNA sequence for clone 253 10. 

SEQ ID NO:327 is the determined cDNA sequence for clone 25302. 
20 SEQ ID NO:328 is the detennined cDNA sequence for clone 253 1 5. 

SEQ ID NO:329 is the determined cDNA sequence for clone 25308. 

SEQ ID NO:330 is the detennined cDNA sequence for clone 25303. 

SEQ ID NOs:33 1-337 are the cDNA sequences of isoforms of the p53 tumor suppressor 

homologue, p63 (also referred to as. L530S). 
25 SEQ ID NOs:338-344 are the amino acid sequences encoded by SEQ ID NOs:33 1-337, 
respectively . 

SEQ ID NO:345 is a second cDNA sequence for the antigen L763P. 

SEQ ID NO:346 is the amino acid sequence encoded by the sequence of SEQ ID NO: 

345. 

30 SEQ ID NO:347 is a detennined full-length cDNA sequence for L523S. 
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SEQ ID NO;348 is the amino acid sequence encoded by SEQ ED NO: 347. 
SEQ ID NO:349 is the cDNA sequence encoding the N-terminal portion of L773P. 
SEQ ID NO:350 is the amino acid sequence of the N-terminal portion of L773P. 
SEQ ID NO:351 is the DNA sequence for a fusion of Ral2 and the N-terminal portion 
5 ofL763P 

SEQ ID NO:352 is the amino acid sequence of the fusion of Ral2 and the N-terminal 
portion of L763P 

SEQ ID NO:353 is the DNA sequence for a fusion of Ral2 and the C-terminal portion 
ofL763P 

10 SEQ ID NO:354 is the amino acid sequence of the fusion of Ral2 and the C-terminal 
portion of L763P 
SEQ ID NO:355 is a primer. 
SEQ ID NO:356 is a primer. 

SEQ ID NO:357 is the protein sequence of expressed recombinant L762P. 
1 5 SEQ ID NO:358 is the DNA sequence of expressed recombinant L762P. 
SEQ ID NO:359 is a primer. 
SEQ ID NO:360 is a primer. 

SEQ ID NO:361 is the protein sequence of expressed recombinant L773P A. 

SEQ BDNO:362 is the DNA sequence of expressed recombinant L773P A. 
20 SEQ ID NO:363 is an epitope derived from clone L773P polypeptide. 

SEQ ID NO:364 is a polynucleotide encoding the polypeptide of SEQ ID NO:363. 

SEQ ID NO:365 is an epitope derived from clone L773P polypeptide. 

SEQ IDNO:366 is a polynucleotide encoding the polypeptide of SEQ ID NO:365. 

SEQ ID NO:367 is an epitope consisting of amino acids 571-590 of SEQ ID NO:161, 
25 clone L762P. 

SEQ ID NO:368 is the full-length DNA sequence for contig 13 (SEQ ID NO:125), also 
referred to as L761P. 

SEQ ID NO:369 is the protein sequence encoded by the DNA sequence of SEQ ID 
NO:368. 

30 SEQ ID NO:370 is an L762P DNA sequence from nucleotides 2071-2130. 
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SEQ ID NO.-371 is an L762P DNA sequence from nucleotides 1441-1500. 

SEQ ID NO:372 is an L762P DNA sequence from nucleotides 1936-1955. 

SEQ ID NO:373 is an L762P DNA sequence from nucleotides 2620-2679. 

SEQ ID NO:374 is an L762P DNA sequence from nucleotides 1801-1 860. 

SEQ ID NO:375 is an L762P DNA sequence from nucleotides 1531-1591. 

SEQ ID NO:376 is the amino acid sequence of the L762P peptide encoded by SEQ ID 

NO:373. 

SEQ ID NO:377 is the amino acid sequence of the L762P peptide encoded by SEQ ID 
NO:370. 

SEQ ID NO:378 is the amino acid sequence of the L762P peptide encoded by SEQ ED 
NO:372. 

SEQ ID NO:379 is the amino acid sequence of the L762P peptide encoded by SEQ ID 
NO:374. 

SEQ ID NO:380 is the amino acid sequence of the L762P peptide encoded by SEQ ID 
NO-.371. 

SEQ ID NO:381 is the amino acid sequence of the L762P peptide encoded by SEQ ID 
NO:375. 

SEQ ID NO:382 is the amino acid sequence of an epitope of L762P. 
SEQ ID NOs:383-386 are PCR primers. 

SEQ ID NOs:387-395 are the amino acid sequences of L773P peptides. 

SEQ ID NOs:396-419 are the amino acid sequences of L523S peptides. 

SEQ ID NO:420 is the determined cDNA sequence for clone #19014. 

SEQ ID NO:42 1 is the forward primer PDM-278 for the L514S-13160 coding region. 

SEQIDNO:422 is the reverse primer PDM-278 fortheL514S-13160 coding region. 

SEQ ID NO:423 is the amino acid sequence for the expressed recombinant L5 14S. 

SEQ ID NO:424 is the DNA coding sequence for the recombinant L514S. 

SEQ ID NO:425 is the forward primer PDM-414 for the L523S coding region. 

SEQ ID NO:426 is the reverse primer PDM-414 for the L523S coding region. 

SEQ ID NO:427 is the amino acid sequence for the expressed recombinant L523S. 

SEQ ID NO:428 is the DNA coding sequence for the recombinant L523S. 
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SEQ ID NO:429 is the reverse primer PDM-279 for the L762PA coding region. 

SEQ ID NO:430 is the amino acid sequence for the expressed recombinant L762PA. 

SEQ ID NO:43 1 is the DNA coding sequence for the recombinant L762PA. 

SEQ ID NO:432 is the reverse primer PDM-300 for the L773P coding region. 
5 SEQ ID NO:433 is the amino acid sequence of the expressed recombinant L773P. 

SEQ ID NO:434 is the DNA coding sequence for the recombinant L773P. 

SEQ ID NO:435 is the forward primer for TCR Valpha8. 

SEQ ID NO:436 is the reverse primer for TCR Valpha8. 

SEQ ID NO:437 is the forward primer for TCR Vbeta8. 
10 SEQ ID NO:438 is the reverse primer for TCR Vbeta8. 

SEQ ID NO:439 is the TCR Valpha DNA sequence of the TCR clone specific for the 

lung antigen L762P. 

SEQ ID NO:440 is the TCR Vbeta DNA sequence of the TCR clone specific for the 
lung antigen L762P. 
1 5 SEQ ID NO:441 is the amino acid sequence of L763 peptide #2684. 

SEQ ID NO:442 is the predicted fall-length cDNA for the cloned partial sequence of 
clone L529S (SEQ ID NO:106). 

SEQ ID NO:443 is the deduced amino acid sequence encoded by SEQ ID NO:442 
SEQ ID NO:444 is the forward primer PDM-734 for the coding region of clone L523S. 
20 SEQ ID NO:445 is the reverse primer PDM-735 for the coding region of clone L523S. 
SEQ ID NO:446 is the amino acid sequence for the expressed recombinant L523S. 
SEQ ID NO:447 is the DNA coding sequence for the recombinant L523S. 
SEQ ID NO:448 is another forward primer PDM-733 for the coding region of clone 
L523S. 

25 SEQ ID NO:449 is the amino acid sequence for a second expressed recombinant L523S. 
SEQ ID NO.450 is the DNA coding sequence for a second recombinant L523S. 
SEQ ID NO:451 corresponds to amino acids 86-110, an epitope of L514S-specific in 
the generation of antibodies. , 

SEQ ID NO:452 corresponds to amino acids 21-45, an epitope of L514S-specific in the 
30 generation of antibodies. 
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SEQ ID NO:453 corresponds to amino acids 121-135, an epitope of L514S-specific in 
the generation of antibodies. 

SEQ ID NO:454 corresponds to amino acids 440-460, an epitope of L523S-specific in 
the generation of antibodies. 
5 SEQ ID NO:455 corresponds to amino acids 156-175, an epitope of L523S-specific in 
the generation of antibodies. 

SEQ ID NO:456 corresponds to amino acids 326-345, an epitope of L523S-specific in 
the generation of antibodies. 

SEQ ID NO:457 corresponds to amino acids 40-59, an epitope of L523S-specific in the 
10 generation of antibodies. 

SEQ ID NO:458 corresponds to amino acids 80-99, an epitope of L523S-specific in the 
generation of antibodies. 

SEQ ID NO:459 corresponds to amino acids 160-179, an epitope of L523S-specific in 
the generation of antibodies. 
15 SEQ ID NO:460 coiTesponds to amino acids 180-199, an epitope of L523S-specific in 
the generation of antibodies. 

SEQ ID NO:461 corresponds to amino acids 320-339, an epitope of L523S-specific in 
the generation of antibodies. 

SEQ ID NO:462 corresponds to amino acids 340-359, an epitope of L523S-specific in 
20 the generation of antibodies. » 

SEQ ID NO:463 corresponds to amino acids 370-389, an epitope of L523S-specific in 
the generation of antibodies. 

SEQ ID NO:464 corresponds to amino acids 380-399, an epitope of L523S-specific in 
the generation of antibodies. 
25 SEQ ID NO:465 corresponds to amino acids 37-55, an epitope of L523S-recognized by 
the L523S-specific CTL line 6B1. 

SEQ ID NO:466 corresponds to amino acids 41-51, the mapped antigenic epitope of 
L523S-recognized by the L523S-specific CTL line 6B1 . 

SEQ ID NO:467 corresponds to the DNA sequence which encodes SEQ ID NO:466. 

30 
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DETAILED DESCRIPTION OF THE INVENTION 

The present invention is directed generally to compositions and their use 
in the therapy and diagnosis of cancer, particularly lung cancer. As described further 
5 below, illustrative compositions of the present invention include, but are not restricted 
to, polypeptides, particularly immunogenic polypeptides, polynucleotides encoding such 
polypeptides, antibodies and other binding agents, antigen presenting cells (APCs) and 
immune system cells {e.g., T cells). 

The practice of the present invention will employ, unless indicated' 
10 specifically to the contrary, conventional methods of virology, immunology, 
microbiology, molecular biology and recombinant DNA techniques within the skill of 
the art, many of which are described below for the purpose of illustration. Such 
techniques are explained folly in the literature. See, e.g., Sambrook, et al. Molecular 
Cloning: A Laboratory Manual (2nd Edition, 1989); Maniatis et al. Molecular Cloning: 
15 A Laboratory Manual (1982); DNA Cloning: A Practical Approach, vol. I & II (D. 
Glover, ed.); Oligonucleotide Synthesis (N. Gait, ed., 1984); Nucleic Acid 
Hybridization (B. Hames & S. Higgins, eds., 1985); Transcription and Translation (B. 
Hames & S. Higgins, eds., 1984); Animal Cell Culture (R. Freshney, ed., 1986); Perbal, 
A Practical Guide to Molecular Cloning (1984). 
20 All publications, patents and patent applications cited herein, whether 

supra or infra, are hereby incorporated by reference in their entirety. 

As used in this specification and the appended claims, the singular forms 
"a," "an M and Mthe " include plural references unless the content clearly dictates 
otherwise. 

25 Polypeptide Compositions 

As used herein, the term "polypeptide" " ist used in its conventional 
meaning, i.e., as a sequence of amino acids. The polypeptides are not limited to a 
specific length of the product; thus, peptides, oligopeptides, and proteins are included 
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within the definition of polypeptide, and such terms may be used interchangeably herein 
unless specifically indicated otherwise. This term also does not refer to or exclude post- 
expression modifications of the polypeptide, for example, glycosylations, acetylations, 
phosphorylations and the like, as well as other modifications known in the art, both 
5 naturally occurring and non-naturally occurring. A polypeptide may be an entire 
protein, or a subsequence thereof. Particular polypeptides of interest in the context of 
this invention are amino acid subsequences comprising epitopes, i.e., antigenic 
determinants substantially responsible for the immunogenic properties of a polypeptide 
and being capable of evoking an immune response. 
10 Particularly illustrative polypeptides of the present invention comprise 

those encoded by a polynucleotide sequence set forth in any one of SEQ ED NOs:l-3, 6- 
8, 10-13, 15-27, 29, 30, 32, 34-49, 51, 52, 54, 55, 57-59, 61-69, 71, 73, 74, 77, 78, 80- 
82, 84, 86-96, 107-109, 111, 113, 125, 127, 128, 129, 131-133, 142, 144, 148-151, 153, 
154, 157, 158, 160, 167, 168, 171, 179, 182, 184-186, 188-191, 193, 194, 198-207, 209, 
15 210, 213, 214, 217, 220-224, 253-337, 345, 347, 349, 358, 362, 364, 365, 368, 370-375, 
420, 424, 428, 431, 434, 442, 447, 450 and 467, or a sequence that hybridizes under 
moderately stringent conditions, or, alternatively, under highly stringent conditions, to a 
polynucleotide sequence set forth in any one of SEQ ID NOs:l-3, 6-8, 10-13, 15-27, 29, 
30, 32, 34-49, 51, 52, 54, 55, 57-59, 61-69, 71, 73, 74, 77, 78, 80-82, 84, 86-96, 107- 
20 109, 111, 113, 125, 127, 128, 129, 131-133,' 142, 144, 148-151, 153, 154, 157, 158, 
160, 167, 168, 171, 179, 182, 184-186, 188-191, 193, 194, 198-207, 209, 210, 213, 214, 
217, 220-224, 253-337, 345, 347, 349, 358, 362, 364, 365, 368, 370-375, 420, 424, 428, 
431, 434, 442, 447, 450 and 467. Certain illustrative polypeptides of the invention 
comprise amino acid sequences as set forth in any one of SEQ ID NOs.i52, 155, 156, 
25 165, 166, 169, 170,. 172, 174, 176, 226-252, 338-344, 346, 350, 357, 361, 363, 365, 
367, 369, 376-382, 387-419, 423, 427, 430, 433, 441, 443, 446, 449 and 451-466. 

The polypeptides of the present invention are sometimes herein referred 
to as lung tumor proteins or lung tumor polypeptides, as t an indication that then- 
identification has been based at least in part upon their increased levels of expression in 
30 lung tumor samples. Thus, a "lung tumor polypeptide" or "lung tumor protein," refers 



24 



WO 02/00174 



PCTAJS01/21065 



generally to a polypeptide sequence of the present invention, or a polynucleotide 
sequence encoding such a polypeptide, that is expressed in a substantial proportion of 
lung tumor samples, for example preferably greater than about 20%, more preferably 
greater than about 30%, and most preferably greater than about 50% or more of lung 

5 tumor samples tested, at a level that is at least two fold, and preferably at least five fold, 
greater than the level of expression in normal tissues, as determined using a 
representative assay provided herein. A lung tumor polypeptide sequence of the 
invention, based upon its increased level of expression in tumor cells, has particular 
utility both as a diagnostic marker as well as a therapeutic target, as further described 

10 below. 

In certain preferred embodiments, the polypeptides of the invention are 
immunogenic, i.e. 9 they react detectably within an immunoassay (such as an ELISA or 
T-cell stimulation assay) with antisera and/or T-cells from a patient with lung cancer. 
Screening for immunogenic activity can be performed using techniques well known to 

1 5 the skilled artisan. For example, such screens can be performed using methods such as 
those described in Harlow and Lane, Antibodies: A Laboratory Manual, Cold Spring 
Harbor Laboratory, 1988. In one illustrative example, a polypeptide may be 
immobilized on a solid support and contacted with patient sera to allow binding of 
antibodies within the sera to the immobilized polypeptide. Unbound sera may then be 

20 removed and bound antibodies detected using! for example, 125 I-labeled Protein A. 

As would be recognized by the skilled artisan, immunogenic portions of 
the polypeptides disclosed herein are also encompassed by the present invention. An 
"immunogenic portion," as used herein, is a fragment of an immunogenic polypeptide 
of the invention that itself is immunologically reactive (ie., specifically binds) with the 

25 B-cells and/or T-cell surface antigen receptors that recognize the polypeptide. 
Immunogenic portions may generally be identified using well known techniques, such 
as those summarized in Paul, Fundamental Immunology, 3rd ed., 243-247 (Raven Press, 
1993) and references cited therein. Such techniques include screening polypeptides for 
the ability to react with antigen-specific antibodies, antisera and/or T-cell lines or 

30 clones. As used herein, antisera and antibodies are "antigen-specific" if they 
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specifically bind to an antigen {i.e., they react with the protein in an EUSA or other 
immunoassay, and do not react detectably with unrelated proteins). Such antisera and 
antibodies may be prepared as described herein, and using well-known techniques. 

In one preferred embodiment, an immunogenic portion of a polypeptide 
5 of the present invention is a portion that reacts with antisera and/or T-cells at a level that 
is not substantially less than the reactivity of the full-length polypeptide {e.g., in an 
ELISA and/or T-cell reactivity .assay). Preferably, the level of immunogenic activity of 
the immunogenic portion is at least about 50%, preferably at least about 70% and most 
preferably greater than about 90% of the immunogenicity for the full-length 
10 polypeptide. In some instances, preferred immunogenic portions will be identified that 
have a level of immunogenic activity greater than that of the corresponding full-length 
polypeptide, e.g., having greater than about 100% or 150% or more immunogenic 
activity. 

In certain other embodiments, illustrative immunogenic portions may 

15 include peptides in which an N-terminal leader sequence and/or transmembrane domain 
have been deleted. Other illustrative immunogenic portions will contain a small N- 
and/or C-terminal deletion {e.g., 1-30 amino acids, preferably 5-15 amino acids), 
relative to the mature protein. 

In another embodiment, a polypeptide composition of the invention may 

20 also comprise one or more polypeptides that &re immunologically reactive with T cells 
and/or antibodies generated against a polypeptide of the invention, particularly a 
polypeptide having an amino acid sequence disclosed herein, or to an immunogenic 
fragment or variant thereof. 

In another embodiment of the invention, polypeptides are provided that 

25 comprise one or more polypeptides that are capable of eliciting T cells and/or antibodies 
that are immunologically reactive with one or more polypeptides described herein, or 
one or more polypeptides encoded by contiguous nucleic acid sequences contained in 
the polynucleotide sequences disclosed herein, or immunogenic fragments or variants 
thereof, or to one or more nucleic acid sequences which hybridize to one or more of 

30 these sequences under conditions of moderate to high stringency. 
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The present invention, in another aspect, provides polypeptide fragments 
comprising at least about 5, 10, 15, 20, 25, 50, or 100 contiguous amino acids, or more, 
including all intermediate lengths, of a polypeptide compositions set forth herein, such 
as those set forth in SEQ ID NOs:152, 155, 156, 165, 166, 169, 170, 172, 174, 176, 

5 226-252, 338-344, 346, 350, 357, 361, 363, 365, 367, 369, 376-382 and 387-419, 441, 
443, 446, 449 and 451-466, or those encoded by a polynucleotide sequence set forth in a 
sequence of SEQ ID NOs:l-3, 6-8, 10-13, 15-27, 29, 30, 32, 34-49, 51, 52, 54, 55, 57- 
59, 61-69, 71, 73, 74, 77, 78, 80-82, 84, 86-96/107-109, 111, 113, 125, 127, 128, 129, 
131-133, 142, 144, 148-151, 153, 154, 157, 158, 160, 167, 168, 171, 179, 182, 184-186, 

10 188-191, 193, 194, 198-207, 209, 210, 213, 214, 217, 220-224, 253-337, 345, 347, 349, 
358, 362, 364, 365, 368, 370-375, 420, 424, 428, 431, 434, 442, 447, 450 and 467. 

In another aspect, the present invention provides variants of the 
polypeptide compositions described herein. Polypeptide variants generally 
encompassed by the present invention will typically exhibit at least about 70%, 75%, 

15 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% or more identity 
(determined as described below), along its length, to a polypeptide sequences set forth 
herein. 

In one preferred embodiment, the polypeptide fragments and variants 
provide by the present invention are immunologically reactive with an antibody and/or 
20 T-cell that reacts with a full-length polypeptide specifically set for the herein. 

In another preferred embodiment, the polypeptide fragments and variants 
provided by the present invention exhibit a level of immunogenic activity of at least 
about 50%, preferably at least about 70%, and most preferably at least about 90% or 
more of that exhibited by a full-length polypeptide sequence specifically set forth 
25 herein. 

A polypeptide 'Variant," as the term is used herein, is a polypeptide that 
typically differs from a polypeptide specifically disclosed herein in one or more 
substitutions, deletions, additions and/or insertions. Such variants may be naturally 
occurring or may be synthetically generated, for example, by modifying one or more of 
30 the above polypeptide sequences of the invention and evaluating their immunogenic 
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activity as described herein and/or using any of a number of techniques well known in 
the art 

For example, certain illustrative variants of the polypeptides of the 
invention include those in which one or more portions, such as an N-terminal leader 
sequence or transmembrane domain, have been removed. Other illustrative variants 
include variants in which a small portion (e.g., 1-30 amino acids, preferably 5-15 amino 
acids) has been removed from the N- and/or C-terminal of the mature protein. 

In many instances, a variant will contain conservative substitutions A 
"conservative substitution" is one in which an amino acid is substituted for another 
amino acid that has similar properties, such that one skilled in the art of peptide 
chemistry would expect the secondary structure and hydropathic nature of the 
polypeptide to be substantially unchanged. As described above, modifications may be 
made in the structure of the polynucleotides and polypeptides of the present invention 
and still obtain a functional molecule that encodes a variant or derivative polypeptide 
with desirable characteristics, e.g., with immunogenic characteristics. When it is 
desired to alter the amino acid sequence of a polypeptide to create an equivalent, or 
even an improved, immunogenic variant or portion of a polypeptide of the invention, 
one skilled in the art will typically change one or more of the codons of the encoding 
DNA sequence according to Table 1 . 

For example, certain amino a<Jids may be substituted for other amino 
acids in a protein structure without appreciable loss of interactive binding capacity with 
structures such as, for example, antigen-binding regions of antibodies or binding sites 
on substrate molecules. Since it is the interactive capacity and nature of a protein that 
defines that protein's biological functional activity, certain amino acid sequence 
substitutions can be made in a protein sequence, and, of course, its underlying DNA 
coding sequence, and nevertheless obtain a protein with like properties. It is thus 
contemplated that various changes may be made in the peptide sequences of the 
disclosed compositions, or corresponding DNA sequences which encode said peptides 
without appreciable loss of their biological utility or activity. 
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Table 1 



Amino Acids 






Codons 




Alanine 


Ala 


A 


GCA 


GCC 


GCG 


GCU 


Cysteine 


Cys 


C 


UGC 


UGU 






Aspartic acid 


Asp 


D 


GAC 


GAU 






Glutamic acid 


Glu 


E 


GAA 


GAG 






Phenylalanine 


Phe 


F 


UUC 


UUU 






Glycine 


Gly 


G 


GGA 


GGC 


GGG 


GGU 


Histidine 


His 


H 


CAC 


CAU 






Isoleucine 


lie 


I 


AUA 


AUC 


AUU 




Lysine 


Lys 


K 


AAA 


AAG 






Leucine 


Leu 


L 


UUA 


UUG 


CUA 


cue 


Methionine 


Met 


M 


AUG 








Asparagine 


Asn 


N 


AAC 


AAU 






Proline 


Pro 


P 


CCA 


CCC 


CCG 


ecu 


Glutamine 


Gin 


Q 


CAA 


CAG 






Arginine 


Arg 


R 


AGA 


AGG 


CGA 


CGC 


Serine 


Ser 


S 


AGC 


AGU 


UCA 


UCC 


Threonine 


Thr 


T 


ACA 


ACC 


ACG 


ACU 


Valine 


Val 


V 


GUA 


GUC 


GUG 


GUU 


Tryptophan 


Trp 


W 


UGG 








Tyrosine 


Tyr 


Y 


UAC 


UAU 







In making such changes, the hydropathic index of amino acids may be 
5 considered. The importance of the hydropathic amino acid index in conferring 
interactive biologic function on a protein is generally understood in the art (Kyte and 
Doolittle, 1982, incorporated herein by reference). It is accepted that the relative 
hydropathic character of the amino acid contributes to the secondary structure of the 
resultant protein, which in turn defines the interaction of the protein with other 
10 molecules, for example, enzymes, substrates, receptors, DNA, antibodies, antigens, and 
the like. Each amino acid has been assigned a hydropathic index on the basis of its 
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hydrophobic^ and charge characteristics (Kyte and Dootittle, 1982). These values are: 
isoleucine (+4.5); valine (+4.2); leucine (+3.8); phenylalanine (+2.8); cysteine/cystine 
(+2.5); methionine (+1 .9); alanine (+1 .8); glycine (-0.4); threonine (-0.7); serine (-0.8); 
tryptophan (-0.9); tyrosine (-1.3); proline (-1.6); histidine (-3.2); glutamate (-3.5); 
5 glutamine (-3.5); aspartate (-3.5); asparagine (-3.5); lysine (-3.9); and arginine (-4.5). 

It is known in the art that certain amino acids may be substituted by other 
amino acids having a similar hydropathic index or score and still result in a protein with 
similar biological activity, Le. still obtain a biological functionally equivalent protein. 
In making such changes, the substitution of amino acids whose hydropathic indices are 
10 within ±2 is preferred, those within ±1 are particularly preferred, and those within ±0.5 
are even more particularly preferred. It is also understood in the art that the substitution 
of like amino acids can be made effectively on the basis of hydrophilicity. U. S. Patent 
4,554,101 (specifically incorporated herein by reference in its entirety), states that the 
greatest local average hydrophilicity of a protein, as governed by the hydrophilicity of 
1 5 its adjacent amino acids, correlates with a biological property of the protein. 

As detailed in U. S. Patent 4,554,1 01, the following hydrophilicity values 
have been assigned to amino acid residues: arginine (+3.0); lysine (+3.0); aspartate 
(+3.0 ± 1); glutamate (+3.0 ± 1); serine (+0.3); asparagine (+0.2); glutamine (+0.2); 
glycine (0); threonine (-0.4); proline (-0.5 ± 1); alanine (-0.5); histidine (-0.5); cysteine 
20 (-1.0); methionine (-1.3); valine (-1.5); leucine (-1.8); isoleucine (-1.8); tyrosine (- 
2.3); phenylalanine (-2.5); tryptophan (-3.4). It is understood that an amino acid can be 
substituted for another having a similar hydrophilicity value and still obtain a 
biologically equivalent; and in particular, an immunologically equivalent protein. In 
such changes, the substitution of amino acids whose hydrophilicity values are within +2 
25 is preferred, those within ±1 are particularly preferred, and those within +0.5 are even 
more particularly preferred.. 

As outlined above, amino acid substitutions are generally therefore based 
on the relative similarity of the amino acid side-chain substituents, for example, their 
hydrophobicity, hydrophilicity, charge, size, and the like. Exemplary substitutions that 
30 take various of the foregoing characteristics into consideration are well known to those 
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of skill in the art and include: arginine and lysine; glutamate and aspartate; serine and 
threonine; glutamine and asparagine; and valine, leucine and isoleucine. 

In addition, any polynucleotide may be further modified to increase 
stability in vivo. Possible modifications include, but are not limited to, the addition of 
5 flanking sequences at the 5 1 and/or 3' ends; the use of phosphorothioate or 2' Omethyl 
rather than phosphodiesterase linkages in the backbone; and/or the inclusion of 
nontraditional bases such as inosine, queosine and wybutosine, as well as acetyl- 
methyl-, thio- and other modified forms of adenine, cytidine, guanine, thymine and 
uridine. 

10 Amino acid substitutions may further be made on the basis of similarity 

in polarity, charge, solubility, hydrophobicity, hydrophilicity and/or the amphipathic 
natuie of the residues. For example, negatively charged amino acids include aspartic 
acid and glutamic acid; positively charged amino acids include lysine and arginine; and 
amino acids with uncharged polar head groups having similar hydrophilicity values 

15 include leucine, isoleucine and valine; glycine and alanine; asparagine and glutamine; 
and serine, threonine, phenylalanine and tyrosine. Other groups of amino acids that may 
represent conservative changes include: (1) ala, pro, gly, glu, asp, gin, asn, ser, thr; 
(2) cys, ser, tyr, thr; (3) val, ile, leu, met, ala, phe; (4) lys, arg, his; and (5) phe, tyr, trp, 
his. A variant may also, or alternatively, contain non-conservative changes. In a 

20 preferred embodiment, variant polypeptides differ from a native sequence by 
substitution, deletion or addition of five amino acids or fewer. Variants may also (or 
alternatively) be modified by, for example,- the deletion or addition of amino acids that 
have minimal influence on the immunogenicity, secondary structure and hydropathic 
nature of the polypeptide. 

25 As noted above, polypeptides may comprise a signal (or leader) sequence 

at the N-terminal end of the protein, which co-translationally or post-translationally 
directs transfer of the protein. The polypeptide may also be conjugated to a linker or 
other sequence for ease of synthesis, purification or identification of the polypeptide 
(e.g., poly-His), or to enhance binding of the polypeptide to a solid support. For 

30 example, a polypeptide may be conjugated to an immunoglobulin Fc region. 
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When comparing polypeptide sequences, two sequences are said to be 
"identical" if the sequence of amino acids in the two sequences is the same when 
aligned for maximum correspondence, as described below. Comparisons between two 
sequences are typically performed by comparing the sequences over a comparison 
5 window to identify and compare local regions of sequence similarity. A "comparison 
window" as used herein, refers to a segment of at least about 20 contiguous positions, 
usually 30 to about 75, 40 to about 50, in which a sequence may be compared to a 
reference sequence of the same number of contiguous positions after the two sequences 
are optimally aligned. 

1 0 Optimal alignment of sequences for comparison may be conducted using 

the Megalign program in the Lasergene suite of bioinformatics software (DNASTAR, 
Inc., Madison, WI), using default parameters. This program embodies several 
alignment schemes described in the following references: Dayhoff, M.O. (1978) A 
model of evolutionary change in proteins - Matrices for detecting distant relationships. 
15 In Dayhoff, M.O. (ed.) Atlas of Protein Sequence and Structure, National Biomedical 
Research Foundation, Washington DC Vol. 5, Suppl. 3, pp. 345-358; Hein J. (1990) 
Unified Approach to Alignment and Phylogenes pp. 626-645 Methods in Enzymology 
vol. 183, Academic Press, Inc., San Diego, CA; Higgins, D.G. and Sharp, P.M. (1989) 
CABIOS 5:151-153; Myers, E.W. and Muller W. (1988) CABIOS 4:11-17; Robinson, 
20 E.D. (1971) Comb. Theor 77:105; Santou, Ni Nes, M. (1987) Mol Biol Evol 4:406- 
425; Sneath, P.H.A. and Sokal, R.R. (1973) Numerical Taxonomy - the Principles and 
Practice of Numerical Taxonomy, Freeman Press, San Francisco, CA; Wilbur, W.J. and 
Lipman, D.J. (1983) Proc. Natl. Acad, SclUSA 50:726-730. 

Alternatively, optimal alignment of sequences for comparison may be 
25 conducted by the local identity algorithm of Smith and Waterman (1981) Add APL. 
Math 2:482, by the identity alignment algorithm of Needleman and Wunsch (1970) J. 
Mol Biol 48:443, by the search for similarity methods of Pearson and Lipman (1988) 
Proc. Natl Acad. Set USA 85: 2444, by computerized implementations of these 
algorithms (GAP, BESTFIT, BLAST, FASTA, and TFASTA in the Wisconsin Genetics 
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Software^Package, Genetics Computer Group (GCG), 575 Science Dr., Madison, WI), 
or by inspection. 

One preferred example of algorithms that are suitable for determining 
percent sequence identity and sequence similarity are the BLAST and BLAST 2.0 

5 algorithms, which are described in Altschul et al. (1977) NucL Acids Res. 25:3389-3402 
and Altschul et al. (1990)7. Mol Biol 215:403-410, respectively. BLAST and BLAST 
2.0 can be used, for example with the parameters described herein, to determine percent 
sequence identity for the polynucleotides and polypeptides of the invention. Software 
for performing BLAST analyses is publicly available through the National Center for 

10 Biotechnology Information. For amino acid sequences, a scoring matrix can be used to 
calculate the cumulative score. Extension of the word hits in each direction are halted 
when: the cumulative alignment score falls off by the quantity X from its maximum 
achieved value; the cumulative score goes to zero or below, due to the accumulation of 
one or more negative-scoring residue alignments; or the end of either sequence is 

15 reached. The BLAST algorithm parameters W, T and X determine the sensitivity and 
speed of the alignment 

In one preferred approach, the "percentage of sequence identity" is 
determined by comparing two optimally aligned sequences over a window of 
comparison of at least 20 positions, wherein the portion of the polypeptide sequence in 

20 the comparison window may comprise additions or deletions (i.e., gaps) of 20 percent 
or less, usually 5 to 15 percent, or 10 to 12 percent, as compared to the reference 
sequences (which does not comprise additions or deletions) for optimal alignment of the 
two sequences. The percentage is calculated by determining the number of positions at 
which the identical amino acid residue occurs in both sequences to yield the number of 

25 matched positions, dividing the number of matched positions by the total number of 
positions in the reference sequence (i.e., the window size) and multiplying the results by 
100 to yield the percentage of sequence identity. 

Within other illustrative embodiments, a polypeptide may be a fusion 
polypeptide that comprises multiple polypeptides as described herein, or that comprises 

30 at least one polypeptide as described herein and an unrelated sequence, such as a known 
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tumor protein. A fusion partner may, for example, assist in providing T helper epitopes 
(an immunological fusion partner), preferably T helper epitopes recognized by humans, 
or may assist in expressing the protein (an expression enhancer) at higher yields than the 
native recombinant protein. Certain preferred fusion partners are both immunological 
5 and expression enhancing fusion partners. Other fusion partners may be selected so as 
to increase the solubility of the polypeptide or to enable the polypeptide to be targeted to 
desired intracellular compartments. Still further fusion partners include affinity tags, 
which facilitate purification of the polypeptide. 

Fusion polypeptides may generally be prepared using standard 
10 techniques, including chemical conjugation. Preferably, a fusion polypeptide is 
expressed as a recombinant polypeptide, allowing the production of increased levels, 
relative to a non-fused polypeptide, in an expression system. Briefly, DNA sequences 
encoding the polypeptide components may be assembled separately, and ligated into an 
appropriate expression vector. The 3' end of the DNA sequence encoding one 

15 polypeptide component is ligated, with or without a peptide linker, to the 5* end of a 
DNA sequence encoding the second polypeptide component so that the reading frames 
of the sequences are in phase. This permits translation into a single fusion polypeptide 
that retains the biological activity of both component polypeptides. 

A peptide linker sequence may be employed to separate the first and 

20 second polypeptide components by a distance 'sufficient to ensure that each polypeptide 
folds into its secondary and tertiary structures. Such a peptide linker sequence is 
incorporated into the fusion polypeptide using standard techniques well known in the 
art. Suitable peptide linker sequences may be chosen based on the following factors: 
(1) their ability to adopt a flexible extended conformation; (2) their inability to adopt a 

25 secondary structure that could interact with functional epitopes on the first and second 
polypeptides; and (3) the lack of hydrophobic or charged residues that might react with 
the polypeptide functional epitopes. Preferred peptide linker sequences contain Gly, 
Asn and Ser residues. Other near neutral amino acids, such as Thr and Ala may also be 
used in the linker sequence. Amino acid sequences which may be usefully employed as 

30 linkers include those disclosed in Maratea et al., Gene 40:39-46, 1985; Murphy et al., 
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Proc. Natl. Acad. Sci. USA 53:8258-8262, 1986; U.S. Patent No. 4,935,233 and U.S. 
Patent No. 4,751,180. The linker sequence may generally be from 1 to about 50 amino 
acids in length. Linker sequences are not required when the first and second 
polypeptides have non-essential N-terminal amino acid regions that can be used to 

5 separate the functional domains and prevent steric interference. 

The ligated DNA sequences are operably linked to suitable 
transcriptional or translational regulatory elements. The regulatory elements 
responsible for expression of DNA are located only 5' to the DNA sequence encoding 
the first polypeptides. Similarly, stop codons required to end translation and 

10 transcription termination signals are only present 3' to the DNA sequence encoding the 

second polypeptide. 

The fusion polypeptide can comprise a polypeptide as described herein 
together with an unrelated immunogenic protein, such as an immunogenic protein 
capable of eliciting a recall response. Examples of such proteins include tetanus, 
15 tuberculosis and hepatitis proteins (see, for example, Stoute et al. New Engl. J. Med., 
336:86-91, 1997). 

In one preferred embodiment, the immunological fusion partner is 
derived from a Mycobacterium sp., such as a Mycobacterium tuberculosis-derived Ral2 
fragment. Ral2 compositions and methods for their use in enhancing the expression 

20 and/or immunogenicity of heterologous polynucleotide/polypeptide sequences is 
described in U.S. Patent Application 60/158,585, the disclosure of which is 
incorporated herein by reference in its entirety. Briefly, Ral2 refers to a polynucleotide 
region that is a subsequence of a Mycobacterium tuberculosis MTB32A nucleic acid. 
MTB32A is a serine protease of 32 KD molecular weight encoded by a gene in virulent 

25 and avirulent strains of M. tuberculosis. The nucleotide sequence and amino acid 
sequence of MTB32A have been described (for example, U.S. Patent Application 
60/158,585; see also, Skeiky et al., Infection and Immun. (1999) 67:3998-4007, 
incorporated herein by reference). C-terminal fragments of the MTB32A coding 
sequence express at high levels and remain as a soluble polypeptides throughout the 

30 purification process. Moreover, Ral2 may enhance the immunogenicity of heterologous 
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immunogenic polypeptides with which it is fused. One preferred Ral2 fusion 
polypeptide comprises a 14 KD C-terminal fragment corresponding to amino acid 
4 residues 192 to 323 of MTB32A. 

Other preferred Ral2 polynucleotides generally comprise at least about 
5 15 consecutive nucleotides, at least about 30 nucleotides, at least about 60 nucleotides, 
at least about 100 nucleotides, at least about 200 nucleotides, or at least about 300 
nucleotides that encode a portion of a Ral2 polypeptide. 

Ral2 polynucleotides may comprise a native sequence (i.e., an 
endogenous sequence that encodes a Ral2 polypeptide or a portion thereof) or may 
10 comprise a variant of such a sequence. Ral2 polynucleotide variants may contain one 
or more substitutions, additions, deletions and/or insertions such that the biological 
activity of the encoded fusion polypeptide is not substantially diminished, relative to a 
fusion polypeptide comprising a native Ral2 polypeptide. Variants preferably exhibit at 
least about 70% identity, more preferably at least about 80% identity and most 
1 5 preferably at least about 90% identity to a polynucleotide sequence that encodes a native 
Ral2 polypeptide or a portion thereof 

Within other preferred embodiments, an immunological fusion partner is 
derived from protein D, a surface protein of the gram-negative bacterium Haemophilus 
influenza B (WO 91/18926). Preferably, a protein D derivative comprises 
20 approximately the first third of the protein (fe.g., the first N-terminal 100-110 amino 
acids), and a protein D derivative may be lipidated: Within certain preferred 
embodiments, the first 109 residues of a Lipoprotein D fusion partner is included on the 
N-terminus to provide the polypeptide with additional exogenous T-cell epitopes and to 
increase the expression level in E. coli (thus functioning as an expression enhancer). 
25 The lipid tail ensures optimal presentation of the antigen to antigen presenting cells. 
Other fusion partners include the non-structural protein from influenzae virus, NS1 
(hemaglutinin). Typically, the N-terminal 81 amino acids are used, although different 
fragments that include T-helper epitopes may be used. 

In another embodiment, the immunological fusion partner is the protein 
30 known as LYTA, or a portion thereof (preferably a C-teiminal portion). LYTA is 
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derived from Streptococcus pneumoniae, which syntheses an N-aeetyl-L-alanine 
amidase known as amidase LYTA (encoded by the LytA gene; Gene 43:265-292, 1986). 
LYTA is an autolysin that specifically degrades certain bonds in the peptidoglycan 
backbone. The C-terminal domain of the LYTA protein is responsible for the affinity to 
5 the choline or to some choline analogues such as DEAE. This property has been 
exploited for the development of K coli C-LYTA expressing plasmids useful for 
expression of fusion proteins. Purification of hybrid proteins containing the C-LYTA 
fragment at the amino terminus has been described {see Biotechnology J0:795-798, 
1992). Within a preferred embodiment, a repeat portion of LYTA may be incorporated 
10 into a fusion polypeptide. A repeat portion is found in the C-terminal region starting at 
residue 178. A particularly preferred repeat portion incorporates residues 188-305. 

Yet another illustrative embodiment involves fusion polypeptides, and 
the polynucleotides encoding them, wherein the fusion partner comprises a targeting 
signal capable of directing a polypeptide to the endosomal/lysosomal compartment, as 
15 described in U.S. Patent No. 5,63334. An immunogenic polypeptide of the invention, 
when fused with this targeting signal, will associate more efficiently with MHC class II 
molecules and thereby provide enhanced in vivo stimulation of CD4 + T-cells specific 

for the polypeptide. 

Polypeptides of the invention are prepared using any of a variety of well 

20 known synthetic and/or recombinant techniques, the latter of which are further 
described below. Polypeptides, portions and other variants generally less than about 
150 amino acids can be generated by synthetic means, using techniques well known to 
those of ordinary skill in the art. In one Ulustrative example, such polypeptides are 
synthesized using any of the commercially available solid-phase techniques, such as the 

25 Merrifield solid-phase synthesis method, where amino acids are sequentially added to a 
growing amino acid chain. See Merrifield, J. Am. Chem. Soc. 552149-2146, 1963. 
Equipment for automated synthesis of polypeptides is commercially available from 
suppliers such as Perkin Elmer/Applied BioSystems Division (Foster City, CA), and 
may be operated according to the manufacturer's instructions. 
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In general, polypeptide compositions (including fusion polypeptides) of 
the invention are isolated. An "isolated" polypeptide is one that is removed from its 
original environment For example, a naturally-occurring protein or polypeptide is 
isolated if it is separated from some or all of the coexisting materials in the natural 
5 system. Preferably, such polypeptides are also purified, e.g., are at least about 90% 
pure, more preferably at least about 95% pure and most preferably at least about 99% 
pure. 

Polynucleotide Compositions 

The present invention, in other aspects, provides polynucleotide 

10 compositions. Hie terms "DNA" and "polynucleotide" are used essentially 
interchangeably herein to refer to a DNA molecule that has been isolated free of total 
genomic DNA of a particular species. "Isolated," as used herein, means that a 
polynucleotide is substantially away from other coding sequences, and that the DNA 
molecule does not contain large portions of unrelated coding DNA, such as large 

15 chromosomal fragments or other functional genes or polypeptide coding regions. Of 
course, this refers to the DNA molecule as originally isolated, and does not exclude 
genes or coding regions later added to the segment by the hand of man. 

As will be understood by those skilled in the art, the polynucleotide 
compositions of this invention can include genomic sequences, extra-genomic and 

20 plasmid-encoded sequences and smaller engineered gene segments that express, or may 
be adapted to express, proteins, polypeptides, peptides and the like. Such segments may 
be naturally isolated, or modified synthetically by the hand of man. 

As will be also recognized by the skilled artisan, polynucleotides of the 
invention may be single-stranded (coding or antisense) or double-stranded, and may be 

25 DNA (genomic, cDNA or synthetic) or RNA molecules, RNA molecules may include 
HnRNA molecules, which contain introns and correspond to a DNA molecule in a one- 
to-one manner, and mRNA molecules, which do not contain introns. Additional coding 
or non-coding sequences may, but need not, be present within a polynucleotide of the 
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present invention, and a polynucleotide may, but need not, be linked to other molecules 

and/or support materials. 

Polynucleotides may comprise a native sequence (i.e., an endogenous 
sequence that encodes a polypeptide/protein of the invention or a portion thereof) or 
5 may comprise a sequence that encodes a variant or derivative, preferably and 
immunogenic variant or derivative, of such a sequence. 

Therefore, according to another aspect of the present invention, 
polynucleotide compositions are provided that comprise some or all of a polynucleotide 
sequence set forth in any one ofSEQ ID NOs:l-3, 6-8, 10-13, 15-27,29,30,32,34-49, 
10 51, 52, 54, 55, 57-59, 61-69, 71, 73, 74, 77, 78, 80-82, 84, 86-96, 107-109, 111, 113, 
125, 127, 128, 129, 131-133, 142, 144, 148-151, 153, 154, 157, 158, 160, 167, 168, 
171, 179, 182, 184-186, 188-191, 193, 194, 198-207, 209, 210, 213, 214, 217, 220-224, 
253-337, 345, 347, 349, 358, 362, 364, 365, 368, 370-375, 420, 424, 428, 431, 434, 
442, 447, 450 and 467, complements of a polynucleotide sequence set form in any one 
15 of SEQ IDNOs:l-3, 6-8, 10-13, 15-27, 29, 30, 32, 34-49, 51, 52, 54, 55, 57-59, 61-69, 
71, 73, 74, 77, 78, 80-82, 84, 86-96, 107-109, 111, 113, 125, 127, 128, 129, 131-133, 
142, 144, 148-151, 153, 154, 157, 158, 160, 167, 168, 171, 179,. 182, 184-186, 188-191, 
193, 194, 198-207, 209, 210, 213, 214, 217, 220-224, 253-337, 345, 347, 349, 358, 362, 
364, 365, 368, 370-375, 420, 424, 428, 431, 434, 442, 447, 450 and 467, and degenerate 
20 variants of a polynucleotide sequence set forth in any one of SEQ ID NOs:l-3, 6-8, 10- 
13, 15-27, 29, 30, 32, 34^9, 51, 52, 54, 55, 57-59, 61-69, 71, 73, 74, 77, 78, 80-82, 84, 
86-96, 107-109, 111, 113, 125, 127, 128, 129, 131-133, 142, 144, 148-151, 153, 154, 
157, 158, 160, 167, 168, 171, 179, 182, 184-186, 188-191, 193, 194, 198-207,209,210, 
213, 214, 217, 220-224, 253-337, 345, 347, 349, 358, 362, 364, 365, 368, 370-375, 420, 
25 424, 428, 431, 434, 442, 447, 450 and 467. In certain preferred embodiments, the 
polynucleotide sequences set forth herein encode immunogenic polypeptides, as 
described above. 

In other related embodiments, the present invention provides 
polynucleotide variants having substantial identity to the sequences disclosed herein in 
30 SEQ ID NOs:l-3, 6-8, 10-13, 15-27, 29, 30, 32, 34-49, 51, 52, 54, 55, 57-59, 61-69, 71, 
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73, 74, 77, 78, 80-82, 84, 86-96, 107-109, 111, 113, 125, 127, 128, 129, 131-133, 142, 
144, 148-151, 153, 154, 157, 158, 160, 167, 168, 171, 179, 182, 184-186, 188-191, 193, 
194, 198-207, 209, 210, 213, 214, 217, 220-224, 253-337, 345, 347, 349, 358, 362, 364, 
365, 368, 370-375, 420, 424, 428, 431, 434, 442, 447, 450 and 467, for example those 
5 comprising at least 70% sequence identity, preferably at least 75%, 80%, 85%, 90%, 
95%, 96%, 97%, 98%, or 99% or higher, sequence identity compared to a 
polynucleotide sequence of this invention using the methods described herein, (e.g. 9 
BLAST analysis using standard parameters, as described below). One skilled in this art 
will recognize that these values can be appropriately adjusted to determine 
10 corresponding identity of proteins encoded by two nucleotide sequences by taking into 
account codon degeneracy, amino acid similarity, reading frame positioning and the 
like. 

Typically, polynucleotide variants will contain one or more substitutions, 
additions, deletions and/or insertions, preferably such that the immunogenicity of the 

15 polypeptide encoded by the variant polynucleotide is not substantially diminished 
relative to a polypeptide encoded by a polynucleotide sequence specifically set forth 
herein). The term 'Variants" should also be understood to encompasses homologous 
genes of xenogenic origin. 

In additional embodiments, the present invention provides 

20 polynucleotide fragments comprising various lengths of contiguous stretches of 
sequence identical to or complementary to one or more of the sequences disclosed 
herein. For example, polynucleotides are provided by this invention that comprise at 
least about 10, 15, 20, 30, 40, 50, 75, 100, 150, 200, 300, 400, 500 or 1000 or more 
contiguous nucleotides of one or more of the sequences disclosed herein as well as all 

25 intermediate lengths there between. It will be readily understood that "intermediate 
lengths", in this context, means any length between the quoted values, such as 16, 17, 
18, 19, etc.; 21, 22, 23, etc.; 30, 31, 32, etc.; 50, 51, 52, 53, etc.; 100, 101, 102, 103, 
etc.; 150, 151, 152, 153, etc.; including all integers through 200-500; 500-1,000, and the 
like. 
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In another embodiment of the invention, polynucleotide compositions are 
provided that are capable of hybridizing under moderate to high stringency conditions to 
a polynucleotide sequence provided herein, or a fragment thereof, or a complementary 
sequence thereof. Hybridization techniques are well known in the art of molecular 
5 biology. For purposes of iUustration, suitable moderately stringent conditions for 
testing the hybridization of a polynucleotide of this invention with other polynucleotides 
include prewashing in a solution of 5 X SSC, 0.5% SDS, 1.0 mM EDTA (pH 8.0); 
hybridizing at 50°C-60°C, 5 X SSC, overnight; followed by washing twice at 65°C for 
20 minutes with each of 2X, 0.5X and 0.2X SSC containing 0.1% SDS. One skilled in 
10 the art will understand that the stringency of hybridization can be readily manipulated, 
such as by altering the salt content of the hybridization solution and/or the temperature 
at which the hybridization is performed. For example, in another embodiment, suitable 
highly stringent hybridization conditions include those described above, with the 
exception that the temperature of hybridization is increased, e.g., to 60-65°C or 65- 
15 70°C. 

In certain preferred embodiments, the polynucleotides described above, 
e.g., polynucleotide variants, fragments and hybridizing sequences, encode polypeptides 
mat are immunologically cross-reactive with a polypeptide sequence specifically set 
forth herein. In other preferred embodiments, such polynucleotides encode 
20 polypeptides that have a level of immunogenifc activity of at least about 50%, preferably 
at least about 70%, and more preferably at least about 90% of that for a polypeptide 
sequence specifically set forth herein. 

The polynucleotides of the present invention, or fragments thereof, 
regaraless of the length of the coding sequence itself, may be combined with other DNA 
25 sequences, such as promoters, polyadenylation signals, additional restriction enzyme 
sites, multiple cloning sites, other coding segments, and the like, such that their overall 
length may vary considerably. It is therefore, contemplated that a nucleic acid fragment 
of almost any length may be employed, with the total length preferably being limited by 
the ease of preparation and use in the intended recombinant DNA protocol. For 
30 example, illustrative polynucleotide segments with total lengths of about 10,000, about 
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5000, about 3000, about 2,000, about 1,000, about 500, about 200, about 100, about 50 
base pairs in length, and the like, (including all intermediate lengths) are contemplated 
to be useful in many implementations of this invention. 

When comparing polynucleotide sequences, two sequences are said to be 
5 "identical" if the sequence of nucleotides in the two sequences is the same when aligned 
for maximum correspondence, as described below. Comparisons between two 
sequences are typically performed by comparing the sequences over a comparison 
window to identify and compare local regions of sequence similarity. A "comparison 
window" as used herein, refers to a segment of at least about 20 contiguous positions, 
10 usually 30 to about 75, 40 to about 50, in which a sequence may be compared to a 
reference sequence of the same number of contiguous positions after the two sequences 
are optimally aligned. 

Optimal alignment of sequences for comparison may be conducted using 
the Megalign program in the Lasergene suite of bioinformatics software (DNASTAR, 

15 Inc., Madison, WI), using default parameters. This program embodies several 
alignment schemes described in the following references: Dayhoff, MO. (1978) A 
model of evolutionary change in proteins - Matrices for detecting distant relationships. 
In Dayhoff, M.O. (ed.) Atlas of Protein Sequence and Structure, National Biomedical 
Research Foundation, Washington DC Vol. 5, Suppl. 3, pp. 345-358; Hein J. (1990) 

20 Unified Approach to Alignment and Phylogenes pp. 626-645 Methods in Enzymology 
vol. 183, Academic Press, Inc., San Diego, CA; Higgins, D.G. and Shaip, P.M. (1989) 
CABIOS 5:151-153; Myers, E.W. and Muller W. (1988) CABIOS 4:11-17; Robinson, 
E.D. (1971) Comb. Theor i/:105; Santou, N. Nes, M. (1987) Mol Biol Evol 4:406- 
425; Sneath, P.H.A. and Sokal, R.R. (1973) Numerical Taxonomy - the Principles and 

25 Practice of Numerical Taxonomy, Freeman Press, San Francisco, CA; Wilbur, WJ. and 
Lipman, DJ. (1983) Proc. Natl. Acad, Sci. USA 50:726-730. 

Alternatively, optimal alignment of sequences for comparison may be 
conducted by the local identity algorithm of Smith and Waterman (1981) Add APL. 
Math 2:482, by the identity alignment algorithm of Needleman and Wunsch (1970) J. 

30 Mol. Biol 48:443, by the search for similarity methods of Pearson and Lipman (1988) 
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Proc. Natl. Acad. Sci. USA 85: 2444, by computerized implementations of these 
algorithms (GAP, BESTFTT, BLAST, FASTA, and TFASTA in the Wisconsin Genetics 
Software Package, Genetics Computer Group (GCG), 575 Science Dr., Madison, WI), 
or by inspection. 

5 One preferred example of algorithms that are suitable for determining 

percent sequence identity and sequence similarity are the BLAST and BLAST 2.0 
algorithms, which are described in Altschul et al. (1977) Nucl. Acids Res. 25:3389-3402 
and Altschul et al. (1990) J. Mol. Biol. 215:403-410, respectively. BLAST and BLAST 
2.0 can be used, for example with the parameters described herein, to determine percent 
10 sequence identity for the polynucleotides of the invention. Software for performing 
BLAST analyses is publicly avaUable through the National Center for Biotechnology 
Information. In one illustrative example, cumulative scores can be calculated using, for 
nucleotide sequences, the parameters M (reward score for a pair of matching residues; 
always >0) and N (penalty score for mismatching residues; always <0). Extension of 
15 the word hits in each direction are halted when: the cumulative alignment score falls off 
by the quantity X from its maximum achieved value; the cumulative score goes to zero 
or below, due to the accumulation of one or more negative-scoring residue alignments; 
or the end of either sequence is reached. The BLAST algorithm parameters W, T and X 
determine the sensitivity and speed of the alignment The BLASTN program (for 
20 nucleotide sequences) uses as defaults a wordlength (W) of 1 1, and expectation (E) of 
10, and the BLOSUM62 scoring matrix (see Henikoff and Henikoff (1989) Proc. Natl. 
Acad Sci. USA 89:10915) alignments, (B) of 50, expectation (E) of 10, M=5, N=-4 and 
a comparison of both strands. 

Preferably, the "pe rcenta S e of s^ 1 "* 5 identity" is determined by 
25 comparing two optimally aligned sequences over a window of comparison of at least 20 
positions, wherein the portion of the polynucleotide sequence in the comparison 
window may comprise additions or deletions (i.e., gaps) of 20 percent or less, usually 5 
to 15 percent, or 10 to 12 percent, as compared to Ihe reference sequences (which does 
not comprise additions or deletions) for optimal alignment of the two sequences. The 
30 percentage is calculated by determining the number of positions at which the identical 
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nucleic acid bases occurs in both sequences to yield the number of matched positions, 
dividing the number of matched positions by the total number of positions in the 
reference sequence (i. e., the window size) and multiplying the results by 1 00 to yield the 
percentage of sequence identity. 
5 It will be appreciated by those of ordinary skill in the art that, as a result 

of the degeneracy of the genetic code, there are many nucleotide sequences that encode 
a polypeptide as described herein. Some of these polynucleotides bear minimal 
homology to the nucleotide sequence of any native gene. Nonetheless, polynucleotides 
that vary due to differences in codon usage are specifically contemplated by the present 
10 invention. Further, alleles of the genes comprising the polynucleotide sequences 
provided herein are within the scope of the present invention. Alleles are endogenous 
genes that are altered as a result of one or more mutations, such as deletions, additions 
and/or substitutions of nucleotides. The resulting mRNA and protein may, but need not, 
have an altered structure or function. Alleles may be identified using standard 
1 5 techniques (such as hybridization, amplification and/or database sequence comparison). 

Therefore, in another embodiment of the invention, a mutagenesis 
approach, such as site-specific mutagenesis, is employed for the preparation of 
immunogenic variants and/or derivatives of the polypeptides described herein. By this 
approach, specific modifications in a polypeptide sequence can be made through 
20 mutagenesis of the underlying polynucleotides that encode them. These techniques 
provides a straightforward approach to prepare and test sequence variants, for example, 
incorporating one or more of the foregoing considerations, by introducing one or more 
nucleotide sequence changes into the polynucleotide. 

Site-specific mutagenesis allows the production of mutants through the 
25 use of specific oligonucleotide sequences which encode the DNA sequence of the 
desired mutation, as well as a sufficient number of adjacent nucleotides, to provide a 
primer sequence of sufficient size and sequence complexity to form a stable duplex on 
both sides of the deletion junction being traversed. Mutations may be employed in a 
selected polynucleotide sequence to improve, alter, decrease, modify, or otherwise 
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change u> properties of the polynucleotide itself, and/or alter the properties, activity, 
composition, stability, or primary sequence of the encoded polypeptide. 

" In certain embodiments of the present invention, the inventors 
contemplate the mutagenesis of the disclosed polynucleotide sequences to alter one or 
5 more properties of the encoded polypeptide, such as the immunogenicity of a 
polypeptide vaccine. The techniques of site-specific mutagenesis are well-known in the 
art, and are widely used to create variants of both polypeptides and polynucleotides. For 
example, site-specific mutagenesis is often used to alter a specific portion of a DNA 
molecule. In such embodiments, a primer comprising typically about 14 to about 25 
10 nucleotides or so in length is employed, with about 5 to about 10 residues on both sides 
of the junction of the sequence being altered. 

As will be appreciated by those of skill in the art, site-specific 
mutagenesis techniques have often employed a phage vector that exists in both a single 
stranded and double stranded form. Typical vectors useful in site-directed mutagenesis 
15 include vectors such as the MB phage. These phage are readily commercially-available 
and their use is generally well-known to those skilled in the art. Double-stranded 
plasmids are also routinely employed in site directed mutagenesis that eliminates the 
step of transferring the gene of .interest from a plasmid to a phage. 

In general, site-directed mutagenesis in accordance herewith is 
20 performed by first obtaining a single-stranded Vector or melting apart of two strands of a 
double-stranded vector that includes within its sequence a DNA sequence that encodes 
the desired peptide. An oligonucleotide primer bearing the desired mutated sequence is 
prepared, generally synthetically. This primer is then annealed with the single-stranded 
vector, and subjected to DNA polymerizing enzymes such as E coli polymerase I 
25 Klenow fragment, in order to complete the synthesis of the mutation-bearing strand. 
Thus, a heteroduplex is formed wherein one strand encodes the original non-mutated 
sequence and the second strand bears the desired mutation. This heteroduplex vector is 
then used to transform appropriate cells, such as E coli cells, and clones are selected 
which include recombinant vectors bearing the mutated sequence arrangement. 
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The preparation of sequence variants of the selected peptide-encoding 
DNA segments using site-directed mutagenesis provides a means of producing 
potentially useful species and is not meant to be limiting as there are other ways in 
which sequence variants of peptides and the DNA sequences encoding them may be 
5 obtained. For example, recombinant vectors encoding the desired peptide sequence 
may be treated with mutagenic agents, such as hydroxylamine, to obtain sequence 
variants. Specific details regarding these methods and protocols are found in the 
teachings of Maloy et aL, 1994; Segal, 1976; Prokop and Bajpai, 1991; Kuby, 1994; and 
Maniatis et aL, 1982, each incorporated herein by reference, for that purpose. 
10 As "sed herein, the term "oligonucleotide directed mutagenesis 

procedure" refers to template-dependent processes and vector-mediated propagation 
which result in an increase in the, concentration of a specific nucleic acid molecule 
relative to its initial concentration, or in an increase in the concentration of a detectable 
signal, such as amplification.. As used herein, the term "oligonucleotide directed 
15 mutagenesis procedure" is intended to refer to a process that involves the 
template-dependent extension of a primer molecule. The term template dependent 
process refers to nucleic acid synthesis of an RNA or a DNA molecule wherein the 
sequence of the newly synthesized strand of nucleic acid is dictated by the well-known 
rules of complementary base pairing (see, for example, Watson, 1987). Typically, 
20 vector mediated methodologies involve the introduction of the nucleic acid fragment 
into a DNA or RNA vector, the clonal amplification of the vector, and the recovery of 
the amplified nucleic acid fragment. Examples of such methodologies are provided by 
U. S. Patent No. 4,237,224, specifically incorporated herein by reference in its entirety. 

In another approach for the production of polypeptide variants of the 
25 present invention, recursive sequence recombination, as described in U.S. Patent No. 
5,837,458, may be employed. In this approach, iterative cycles of recombination and . 
screening or selection are performed to "evolve" individual polynucleotide variants of 
the invention having, for example, enhanced immunogenic activity. 

In other embodiments of the present invention, the polynucleotide 
30 sequences provided herein can be advantageously used as probes or primers for nucleic 
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acid hybridization. As such, it is contemplated lhat nucleic acid segments that comprise 
a sequence region of at least about 15 nucleotide long contiguous sequence that has the 
same sequence as, or is complementary to, a 15 nucleotide long contiguous sequence 
disclosed herein will find particular utility. Longer contiguous identical or 
5 complementary sequences, e.g., those of about 20, 30, 40, 50, 100, 200, 500, 1000 
(including all intermediate lengths) and even up to full length sequences will also be of 
use in certain embodiments. 

The ability of such nucleic acid probes to specifically hybridize to a 
sequence of interest will enable them to be of use in detecting the presence of 
10 complementary sequences in a given sample. However, other uses are also envisioned, 
such as the use of the sequence information for the preparation of mutant species 
primers, or primers for use in preparing other genetic constructions. 

Polynucleotide molecules having sequence regions consisting of 
contiguous nucleotide stretches of 10-14, 15-20, 30, 50, or even of 100-200 nucleotides 
15 or so (including intermediate lengths as well), identical or complementary to a 
polynucleotide sequence disclosed herein, are particularly contemplated as hybridization 
probes for use in, e.g., Southern and Northern blotting. This would allow a gene 
product, or fragment thereof, to be analyzed, both in diverse cell types and also in 
various bacterial cells. The total size of fragment, as well as the size of the 
20 complementary stretch(es), will ultimately depend on the intended use or application of 
the particular nucleic acid segment. Smaller fragments will generally find use in 
hybridization embodiments, wherein the length of the contiguous complementary region 
may be varied, such as between about 15 and about 100 nucleotides, but larger 
contiguous complementarity stretches may be used, according to the length 
25 complementary sequences one wishes to detect. 

The use of a hybridization probe of about 15-25 nucleotides in length 
allows the formation of a duplex molecule that is both stable and selective. Molecules 
having contiguous complementary sequences over stretches greater than 15 bases in 
length are generally preferred, though, in order to increase stability and selectivity of the 
30 hybrid, and thereby improve the quality and degree of specific hybrid molecules 
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obtained. One will generally prefer to design nucleic acid molecules having gene- 
complementary stretches of 15 to 25 contiguous nucleotides, or even longer where 
desired. 

Hybridization probes may be selected from any portion of any of the 
5 sequences disclosed herein. All that is required is to review the sequences set forth 
herein, or to any continuous portion of the sequences, from about 15-25 nucleotides in 
length up to and including the full length sequence, that one wishes to utilize as a probe 
or primer. The choice of probe and primer sequences may be governed by various 
factors. For example, one may wish to employ primers from towards the termini of the 
10 total sequence. 

Small polynucleotide segments or fragments may be readily prepared by, 
for example, directly synthesizing the fragment by chemical means, as is commonly 
practiced using an automated oligonucleotide synthesizer. Also, fragments may be 
obtained by application of nucleic acid reproduction technology, such as the PCR™ 
15 technology of U. S. Patent 4,683,202 (incorporated herein by reference), by introducing 
selected sequences into recombinant vectors for recombinant production, and by other 
recombinant DNA techniques generally known to those of skill in the art of molecular 
biology. 

The nucleotide sequences of the invention may be used for their ability to 
20 selectively form duplex molecules with complementary stretches of the entire gene or 
gene fragments of interest. Depending on the application envisioned, one will typically 
desire to employ varying conditions of hybridization to achieve varying degrees of 
selectivity of probe towards target sequence. For applications requiring high selectivity, 
one will typically desire to employ relatively stringent conditions to form the hybrids, 
25 e.g., one will select relatively low salt and/or high temperature conditions, such as 
provided by a salt concentration of from about 0.02 M to about 0.15 M salt at 
temperatures of from about 50°C to about 70°C. Such selective conditions tolerate 
litde, if any, mismatch between the probe and the template or target strand, and would 
be particularly suitable for isolating related sequences. 
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Of course, for some applications, for example, where one desires to 
prepare mutants employing a mutant primer strand hybridized to an underlying 
template, less stringent (reduced stringency) hybridization conditions will typically be 
needed in order to allow formation of the heteroduplex. In these circumstances, one 
5 may desire to employ salt conditions such as those of from about 0.15 M to about 0.9 M 
salt, at temperatures ranging from about 20°C to about 55°C. Cross-hybridizing species 
can thereby be readily identified as positively hybridizing signals with respect to control 
hybridizations. In any case, it is generally appreciated that conditions can be rendered 
more stringent by the addition of increasing amounts of formamide, which serves to 
10 destabilize the hybrid duplex in ihe same manner as increased temperature. Thus, 
hybridization conditions can be readily manipulated, and thus will generally be a 
method of choice depending on the desired results. 

According to another embodiment of the present invention, 
polynucleotide compositions comprising antisense oligonucleotides are provided. 
15 Antisense oligonucleotides have been demonstrated to be effective and targeted 
inhibitors of protein synthesis, and, consequently, provide a therapeutic approach by 
which a disease can be treated by inhibiting the synthesis of proteins that contribute to 
the disease. The efficacy of antisense oligonucleotides for inhibiting protein synthesis 
is well established. For example, the synthesis of polygalactauronase and the muscarine 
20 type 2 acetylcholine receptor are inhibited by antisense oligonucleotides directed to their 
respective raRNA sequences (U S. Patent 5,739,119 and U. S. Patent 5,759,829). 
Further, examples of antisense inhibition have been demonstrated with the nuclear 
protein cyclin, the multiple drug resistance gene (MDG1), ICAM-1, E-selectin, STK-1, 
striatal GABAa receptor and human EGF (Jaskulski et al, Science. 1988 Jun 
25 10;240(4858):1544-6; Vasanthakumar and Ahmed, Cancer Commun. 1989;1(4):225- 
32; Peris et al, Brain Res Mol Brain Res. 1998 Jun 15;57(2):3 10-20; U. S. Patent 
5,801,154; U.S. Patent 5,789,573; U. S. Patent 5,718,709 and U.S. Patent 5,610,288). 
Antisense constructs have also been described that inhibit and can be used to treat a 
variety of abnormal cellular proliferations, e.g. cancer (U. S. Patent 5,747,470; U. S.. 
30 Patent 5,591,317 and U. S. Patent 5,783,683). 
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Therefore, in certain embodiments, the present invention provides 
oligonucleotide sequences that comprise all, or a portion of, any sequence that is 
capable of specifically binding to polynucleotide sequence described herein, or a 
complement thereof. In one embodiment, the antisense oligonucleotides comprise DNA 
5 or derivatives thereof. In another embodiment, the oligonucleotides comprise RNA or 
derivatives thereof. In a third embodiment, the oligonucleotides are modified DNAs 
comprising a phosphorothioated modified backbone. In a fourth embodiment, the 
oligonucleotide sequences comprise peptide nucleic acids or derivatives thereof. In 
each case, preferred compositions comprise a sequence region that is complementary, 
10 and more preferably substantially-complementary, and even more preferably, 
completely complementary to one or more portions of polynucleotides disclosed herein. 
Selection of antisense compositions specific for a given gene sequence is based upon 
analysis of the chosen target sequence and determination of secondary structure, T m , 
binding energy, and relative stability. Antisense compositions may be selected based 
15 upon their relative inability to form dimers, hairpins, or other secondary structures that 
would reduce or prohibit specific binding to the target mRNA in a host cell. Highly 
preferred target regions of the mRNA, are those which are at or near the AUG 
translation initiation codon, and those sequences which are substantially complementary 
to 5' regions of the mRNA. These secondary structure analyses and target site selection 
20 considerations can be performed, for example; using v.4 of the OLIGO primer analysis 
software and/or the BLASTN 2.0.5 algorithm software (Altschul et al, Nucleic Acids 
Res. 1 997, 25(1 7):3389-402). 

The use of an antisense delivery method employing a short peptide 
vector, termed MPG (27 residues), is also contemplated. The MPG peptide contains a 
25 hydrophobic domain derived from the fusion sequence of HIV gp41 and a hydrophilic 
domain from the nuclear localization sequence of SV40 T-antigen (Morris et a/., 
Nucleic Acids Res. 1997 Jul 15^5(14)2730-6). It has been demonstrated that several 
molecules of the MPG peptide coat the antisense oligonucleotides and can be delivered 
into cultured mammalian cells in less than 1 hour with relatively high efficiency (90%). 
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Further, the interaction with MPG strongly increases both the stability of the 
oligonucleotide to nuclease and the ability to cross the plasma membrane. 

According to another embodiment of the invention, the polynucleotide 
compositions described herein are used in the design and preparation of ribozyme 
5 molecules for inhibiting expression of the tumor polypeptides and proteins of the 
present invention in tumor cells. Ribozymes are RNA-protein complexes lhat cleave 
nucleic acids in a site-specific fashion. Ribozymes have specific catalytic domains that 
possess endonuclease activity (Kim and Cech, Proc Natl Acad Sci U S A. 1987 
Dec;84(24):8788-92; Forster and Symons, Cell. 1987 Apr 24;49(2):21 1-20). For 
10 example, a large number of ribozymes accelerate phosphoester transfer reactions with a 
high degree of specificity, often cleaving only one of several phosphoesters in an 
oligonucleotide substrate (Cech et al, Cell. 1981 Dec;27(3 Pt 2):487-96; Michel and 
Westhof, J Mol Biol. 1990 Dec 5;216(3):585-610; Reinhold-Hurek and Shub, Nature. 
1 992 May 1 4;357(6374): 1 73-6). This specificity has been attributed to the requirement 
15 that the substrate bind via specific base-pairing interactions to the internal guide 
sequence ("IGS") of the ribozyme prior to chemical reaction. 

Six basic varieties of naturally-occurring enzymatic RNAs are known 
presently. Each can catalyze the hydrolysis of RNA phosphodiester bonds in trans (and 
thus can cleave other RNA molecules) under physiological conditions. In general, 
20 enzymatic nucleic acids act by first binding' to a target RNA. Such binding occurs 
through the target binding portion of a enzymatic nucleic acid which is held in close 
proximity to an enzymatic portion of the molecule that acts to cleave the target RNA. 
Thus, the enzymatic nucleic acid first recognizes and then binds a target RNA through 
complementary base-pairing, and once bound to the correct site, acts enzymatically to 
25 cut the target RNA. Strategic cleavage of such a target RNA will destroy its ability to 
direct synthesis of an encoded protein After an enzymatic nucleic acid has bound and 
cleaved its RNA target, it is released from that RNA to search for another target and can 
repeatedly bind and cleave new targets. 

The enzymatic nature of a ribozyme is advantageous over many 
30 technologies, such as antisense technology (where a nucleic acid molecule simply binds 
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to a nucleic acid target to block its translation) since the concentration of ribozyme 
necessary to affect a therapeutic treatment is lower than that of an antisense 
oligonucleotide. This advantage reflects the ability of the ribozyme to act 
enzymatically. Thus, a single ribozyme molecule is able to cleave many molecules of 
5 target RNA. In addition, the ribozyme is a highly specific inhibitor, with the specificity 
of inhibition depending not only on the base pairing mechanism of binding to the target 
RNA, but also on the mechanism of target RNA cleavage. Single mismatches, or base- 
substitutions, near the site of cleavage can completely eliminate catalytic activity of a 
ribozyme. Similar mismatches in antisense molecules do not prevent their action 
10 (Woolf etal, Proc Natl Acad Sci U S A 1992 Aug 15;89(16):7305-9). Thus, the 
specificity of action of a ribozyme is greater than that of an antisense oligonucleotide 
binding the same RNA site. 

The enzymatic nucleic acid molecule may be formed in a hammerhead, 
hairpin, a hepatitis 8 virus, group I intron or RNaseP RNA (in association with an RNA 
15 guide sequence) or Neurospora VS RNA motif Examples of hammerhead motifs are 
described by Rossi et at Nucleic Acids Res. 1992 Sep 1 1;20(17):4559-65. Examples of 
hairpin motifs are described by Hampel et al <Eur. Pat. Appl. Publ. No. EP 0360257), 
Hampel and Tritz, Biochemistry 1989 Jun 13;28(12):4929-33; Hampel etal, Nucleic 
Acids Res. 1990 Jan 25;18(2):299-304 and U. S. Patent 5,631,359. An example of Hie 
20 hepatitis 5 virus motif is described by Perrbtta and Been, Biochemistry. 1992 Dec 
1;3 1(47): 11 843-52; an example of the RNaseP motif is described by Guerrier-Takada 
etal, Cell. 1983 Dec;35(3 Pt 2):849-57; Neurospora VS RNA ribozyme motif is 
described by Collins (Saville and Collins, Cell. 1990 May 18;61(4):685-96; Saville and 
. Collins, Proc Natl Acad Sci U S A. 1991 Oct l;88(19):8826-30; Collins and Olive, 
25 Biochemistry. 1993 Mar 23;32(1 1):2795-9); and an example of the Group I intron is. 
described in (U. S. Patent 4,987,071). All that is important in an enzymatic nucleic acid 
molecule of this invention is that it has a specific substrate binding site which is 
complementary to one or more of the target gene RNA regions, and that it have 
nucleotide sequences within or surrounding that substrate binding site which impart an 
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RNA cleaving activity to the molecule. Thus the ribozyme constructs need not be 
limited to specific motifs mentioned herein. 

Ribozymes may be designed as described in Int. Pat. Appl. Publ. No. 
WO 93/23569 and Int Pat. Appl. Publ. No. WO 94/02595, each specifically 
5 incorporated herein by reference) and synthesized to be tested in vitro and in vivo, as 
described. Such ribozymes can also be optimized for delivery. While specific 
examples are provided, those in the art will recognize that equivalent RNA targets in 
other species can be utilized when necessary. 

Ribozyme activity can be optimized by altering the length of the 

10 ribozyme binding arms, or chemically synthesizing ribozymes with modifications that 
prevent their degradation by serum ribonucleases (see e.g., Int. Pat. Appl. Publ. No. WO 
92/07065; Int Pat. Appl. Publ. No. WO 93/15187; Int. Pat. Appl. Publ. No. WO 
91/03162; Eur. Pat. Appl. Publ. No. 92110298.4; U. S. Patent 5,334,711; and Int. Pat 
Appl. Publ. No. WO 94/13688, which describe various chemical modifications that can 

15 be made to the sugar moieties of enzymatic RNA molecules), modifications which 
enhance their efficacy in cells, and removal of stem II bases to shorten RNA synthesis 
times and reduce chemical requirements. 

Sullivan etal (Int. Pat. Appl. Publ. No. WO 94/02595) describes the 
general methods for delivery of enzymatic RNA molecules. Ribozymes may be 

20 administered to cells by a variety of methbds known to those familiar to the art, 
including, but not restricted to, encapsulation in liposomes, by iontophoresis, or by 
incorporation into other vehicles, such as hydrogels, cyclodextrins, biodegradable 
nanocapsules, and bioadhesive microspheres. For some indications, ribozymes may be 
directly delivered ex vivo to cells or tissues with or without the aforementioned vehicles. 

25 Alternatively, the RNA/vehicle combination may be locally delivered by direct 
inhalation, by direct injection or by use of a catheter, infusion pump or stint. Other 
routes of delivery include, but are not limited to, intravascular, intramuscular, 
subcutaneous or joint injection, aerosol inhalation, oral (tablet or pill form), topical, 
systemic, ocular, intraperitoneal and/or intrathecal delivery. More detailed descriptions 

30 of ribozyme delivery and administration are provided in Int. Pat Appl. Publ. No. WO 
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94/02595 and Int. Pat. Appl. Publ. No. WO 93/23569, each specificaUy incorporated 
herein by reference. 

Another means of accumulating high concentrations of a ribozyme(s) 
within cells is to incorporate the ribozyme-encoding sequences into a DNA expression 
5 vector. Transcription of the ribozyme sequences are driven from a promoter for 
eukaryotic RNA polymerase I (pol I), RNA polymerase II (pol II), or RNA polymerase 
III (pol HI). Transcripts from pol II or pol III promoters will be expressed at high levels 
in all cells; the levels of a given pol H promoter in a given cell type will depend on the 
nature of the gene regulatory sequences (enhancers, silencers, etc.) present nearby. 

10 Prokaryotic RNA polymerase promoters may also be used, providing that the 
prokaiyotic RNA polymerase enzyme is expressed in the appropriate cells. Ribozymes 
expressed from such promoters have been shown to function in mammalian cells. Such 
transcription units can be incorporated into a variety of vectors for introduction into 
mammalian cells, including but not restricted to, plasmid DNA vectors, viral DNA 

15 vectors (such as adenovirus or adeno-associated vectors), or viral RNA vectors (such as 
retroviral, semliki forest virus, sindbis virus vectors). 

In another embodiment of the invention, peptide nucleic acids (PNAs) 
compositions are provided. PNA is a DNA mimic in which the nucleobases are 
attached to a pseudopeptide backbone (Good and Nielsen, Antisense Nucleic Acid Drug 

20 Dev. 1997 7(4) 431-37). PNA is able to« be utilized in a number methods that 
traditionally have used RNA or DNA. Often PNA sequences perform better in 
techniques than the corresponding RNA or DNA sequences and have utilities that are 
not inherent to RNA or DNA. A review of PNA including methods of making, 
characteristics of, and methods of using, is provided by Corey {Trends Biotechnol 1997 

25 Jun;15(6):224-9). As such, in certain embodiments, one may prepare PNA sequences 
that are complementary to one or more portions of the ACE mRNA sequence, and such 
PNA compositions may be used to regulate, alter, decrease, or reduce the translation of 
ACE-specific mRNA, and thereby alter the level of ACE activity in a host cell to which 
such PNA compositions have been administered. 
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PNAs have 2-aminoethyI-glycine linkages replacing the normal 
phosphodiester backbone of DNA (Nielsen et aL 9 Science 1991 Dec 6£54(5037):1497- 
500; Hanvey et aL, Science. 1992 Nov 27;258(5087):1481-5; Hyrup and Nielsen, 
Bioorg Med Chem. 1996 Jan;4(l):5-23). This chemistry has three important 

5 consequences: firstly, in contrast to DNA or phosphorothioate oligonucleotides, PNAs 
aie neutral molecules; secondly, PNAs are achiral, which avoids the need to develop a 
stereoselective synthesis; and thirdly, PNA synthesis uses standard Boc or Fmoc 
protocols for solid-phase peptide synthesis, although other methods, including a 
modified Merrifield method, have been used. 

10 PNA monomers or ready-made oligomers are commercially available 

fiom PerSeptive Biosystems (Framingham, MA). PNA syntheses by either Boc or 
Fmoc protocols are straightforward using manual or automated protocols (Norton et al, 
Bioorg Med Chem. 1995 Apr;3(4):437-45). The manual protocol lends itself to the 
production of chemically modified PNAs or the simultaneous synthesis of families of 

15 closely related PNAs. 

As with peptide synthesis, the success of a particular PNA synthesis will 
depend on the properties of the chosen sequence. For example, while in theory PNAs 
can incorporate any combination of nucleotide bases, the presence of adjacent purines 
can lead to deletions of one or more residues in the product. In expectation of this 

20 difficulty, it is suggested that, in producing PNAs with adjacent purines, one should 
repeat the coupling of residues likely to be added inefficiently. This should be followed 
by the purification of PNAs by reverse-phase high-pressure liquid chromatography, 
providing yields and purity of product similar to those observed during the synthesis of 
peptides. 

25 Modifications of PNAs for a given application may be accomplished by 

coupling amino acids during solid-phase synthesis or by attaching compounds that 
contain a carboxylic acid group to the exposed N-terminal amine. Alternatively, PNAs 
can be modified after synthesis by coupling to an introduced lysine or cysteine. The 
ease with which PNAs can be modified facilitates optimization for better solubility or 

30 for specific functional requirements. Once synthesized, the identity of PNAs and their 
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derivatives can be confirmed by mass spectrometry. Several studies have made and 
utilized modifications of PNAs (for example, Norton et al, Bioorg Med Chem. 1995 
Apr,3(4):437-45; Petersen et al, J Pept Sci. 1995 May-Jun; 1(3): 175-83; Oram et al, 
Biotechniques. 1995 Sep;19(3):472-80; Footer et al, Biochemistry. 1996 Aug 
5 20;35(33): 10673-9; Griffith et al, Nucleic Acids Res. 1995 Aug ll;23(15):3003-8; 
Pardridge et al, Proc Natl Acad Sci U S A 1995 Jun 6;92(12):5592-6; Boffa et al, 
Proc Nad Acad Sci USA. 1995 Mar 14;92(6):1 901-5; Gambacorti-Passerini et al, 
Blood. 1996 Aug 15;88(4):1411-7; Armitage et al, Proc Natl Acad Sci USA. 1997 
Nov ll;94(23):12320-5; Seeger et al, Biotechniques. 1997 Sep;23(3):512-7). U.S. 
10 Patent No. 5,700,922 discusses PNA-DNA-PNA chimeric molecules and their uses in 
diagnostics, modulating protein in organisms, and treatment of conditions susceptible to 
therapeutics. 

Methods of characterizing the antisense binding properties of PNAs are 
discussed in Rose (Anal Chem. 1993 Dec 15;65(24):3545-9) and Jensen et al 
15 (Biochemistry. 1 997 Apr 22;36(16):5072-7). Rose uses capillary gel electrophoresis to 
determine binding of PNAs to their complementary oligonucleotide, measuring the 
relative binding kinetics and stoichiomeuy. Similar types of measurements were made 
by Jensen et al using BIAcore™ technology. 

Other applications of PNAs that have been described and will be 
20 apparent to the skilled artisan include use in DNA strand invasion, antisense inhibition, 
mutational analysis, enhancers of transcription, nucleic acid purification, isolation of 
transcriptionally active genes, blocking of transcription factor binding, genome 
cleavage, biosensors, in situ hybridization, and the like. 

Polynucleotide Id entification, Characterization and Expression 
25 Polynucleotides compositions of the present invention may be identified, 

prepared and/or manipulated using any of a variety of well established techniques (see 
generally, Sambrook et al., Molecular Cloning: A Laboratory Manual, Cold Spring 
Harbor Laboratories, Cold Spring Harbor, NY, 1989, and other like references). For 
example, a polynucleotide may be identified, as described in more detail below, by 
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screenings microarray of cDNAs for tumor-associated expression (le., expression that 
is at least two fold greater in a tumor than in normal tissue, as determined using a 
representative assay provided herein). Such screens may be performed, for example, 
using the microarray technology of Affymetrix, Inc. (Santa Clara, CA) according to the 
5 manufacturer's instructions (and essentially as described by Schena et al., Proc. Natl. 
Acad Sci. USA 93:10614-10619, 1996 and Heller et al., Proc. Natl. Acad. Sci. USA 
94:2150-2155, 1997). Alternatively, polynucleotides may be amplified from cDNA 
prepared from cells expressing the proteins described herein, such as tumor cells. 

Many template dependent processes are available to amplify a target 
10 sequences of interest present in a sample. One of the best known amplification methods 
is the polymerase chain reaction (PCR™) which is described in detail in U.S. Patent 
Nos. 4,683,195, 4,683,202 and 4,800,159, each of which is incorporated herein by 
reference in its entirety. Briefly, in PCR™, two primer sequences are prepared which 
are complementary to regions on opposite complementary strands of the target 
15 sequence. An excess of deoxynucleoside triphosphates is added to a reaction mixture 
along with a DNA polymerase (e.g., Tag polymerase). If the target sequence is present 
in a sample, the primers will bind to the target and the polymerase will cause the 
primers to be extended along the target sequence by adding on nucleotides. By raising 
and lowering the temperature of the reaction mixture, the extended primers will 
20 dissociate from the target to form reaction products, excess primers will bind to the 
target and to the reaction product and the process is repeated. Preferably reverse 
transcription and PCR™ amplification procedure may be performed in order to quantify 
the amount of mRNA amplified. Polymerase chain reaction methodologies are well 
known in the art. 

25 Any of a number of other template dependent processes, many of which 

are variations of the PCR ™ amplification technique, are readily known and available in 
the art. Illustratively, some such methods include the ligase chain reaction (referred to 
as LCR), described, for example, in Eur. Pat Appl. Publ. No. 320,308 and U.S. Patent 
No. 4,883,750; Qbeta Repticase, described in PCT Intl. Pat. Appl. Publ. No. 

30 PCTAJS87/00880; Strand Displacement Amplification (SDA) and Repair Chain 
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Reaction (RCR). Still other amplification methods are described in Great Britain Pat. 
AppL No. 2 202 328, and in PCT Intl. Pat Appl. Publ. No. PCT/US89/01025. Other 
nucleic acid amplification procedures include transcription-based amplification systems 
(TAS) (PCT Intl. Pat. Appl. Publ. No. WO 88/10315), including nucleic acid sequence 
5 based amplification (NASBA) and 3SR. Eur. Pat. Appl. Publ. No. 329,822 describes a 
nucleic acid amplification process involving cyclically synthesizing single-stranded 
RNA ("ssRNA"), ssDNA, and double-stranded DNA (dsDNA). PCT Intl. Pat. Appl. 
Publ. No. WO 89/06700 describes a nucleic acid sequence amplification scheme based 
on the hybridization of a promoter/primer sequence to a target single-stranded DNA 
10 ("ssDNA") followed by transcription of many RNA copies of the sequence. Other 
amplification methods such as "RACE" (Frohman, 1990), and "one-sided PCR" (Ohara, 
1 989) are also well-known to those of skill in the art. 

An amplified portion of a polynucleotide of the present invention may be 
used to isolate a full length gene from a suitable library (e.g., a tumor cDNA library) 

15 using well known techniques. Within such techniques, a library (cDNA or genomic) is 
screened using one or more polynucleotide probes or primers suitable for amplification. 
Preferably, a library is size-selected to include larger molecules. Random primed 
libraries may also be preferred for identifying 5' and upstream regions of genes. 
Genomic libraries are preferred for obtaining introns and extending 5 1 sequences. 

20 For hybridization techniques, a 1 partial sequence may be labeled (e.g., by 

nick-translation or end-labeling with 32 P) using well known techniques. A bacterial or 
bacteriophage library is then generally screened by hybridizing filters containing 
denatured bacterial colonies (or lawns containing phage plaques) with the labeled probe 
(see Sambrook et al., Molecular Cloning: A Laboratory Manual, Cold Spring Harbor 

25 Laboratories, Cold Spring Harbor, NY, 1989). Hybridizing colonies or plaques are 
selected and expanded, and the DNA is isolated for further analysis. cDNA clones may 
be analyzed to determine the amount of additional sequence by, for example, PCR using 
a primer from the partial sequence and a primer from the vector. Restriction maps and 
partial sequences may be generated to identify one or more overlapping clones. The 

30 complete sequence may then be determined using standard techniques, which may 
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involve generating a series of deletion clones. The resulting overlapping sequences can 
then assembled into a single contiguous sequence. A full length cDNA molecule can be 
generated by ligating suitable fragments, using well known techniques. 

Alternatively, amplification techniques, such as those described above, 

5 can be useful for obtaining a full length coding sequence from a partial cDNA sequence. 
One such amplification technique is inverse PCR {see Triglia et al., Nucl Acids Res. 
75:8186, 1988), which uses restriction enzymes to generate a fragment in the known 
region of the gene. The fragment is then circularized by intramolecular ligation and 
used as a template for PCR with divergent primers derived from the known region. 

10 Within an alternative approach, sequences adjacent to a partial sequence may be 
retrieved by amplification with a primer to a linker sequence and a primer specific to a 
known region. The amplified sequences are typically subjected to a second round of 
amplification with the same linker primer and a second primer specific to the known 
region. A variation on this procedure, which employs two primers that initiate 

15 extension in opposite directions from the known sequence, is described in WO 
96/38591. Another such technique is known as "rapid amplification of cDNA ends" or 
RACE. This technique involves the use of an internal primer and an external primer, 
which hybridizes to a polyA region or vector sequence, to identify sequences that are 5 f 
and 3 % of a known sequence. Additional techniques include capture PCR (Lagerstrom et 

20 al., PCR Methods Applic. 1:1 11-19, 1991) and walking PCR (Parker et al., Nucl Acids. 
Res. 79:3055-60, 1991). Other methods employing amplification may also be employed 
to obtain a full length cDNA sequence. 

In certain instances, it is possible to obtain a full length cDNA sequence 
by analysis of sequences provided in an expressed sequence tag (EST) database, such as 

25 that available from GenBank. Searches for overlapping ESTs may generally be 
performed using well known programs (e.g., NCBI BLAST searches), and such ESTs 
may be used to generate a contiguous full length sequence. Full length DNA sequences 
may also be obtained by analysis of genomic fragments. ? 

In other embodiments of the invention, polynucleotide sequences or 

30 fragments thereof which encode polypeptides of the invention, or fusion proteins or 
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functional equivalents thereof, may be used in recombinant DNA molecules to direct 
expression of a polypeptide in appropriate host cells. Due to the inherent degeneracy of 
the genetic code, other DNA sequences that encode substantially the same or a 
functionally equivalent amino acid sequence may be produced and these sequences may 
5 be used to clone and express a given polypeptide. 

As will be understood by those of skill in the art, it may be advantageous 
in some instances to produce polypeptide-encoding nucleotide sequences possessing 
non-naturally occurring codons. For example, codons preferred by a particular 
prokaryotic or eukaryotic host can be selected to increase the rate of protein expression 
10 or to produce a recombinant RNA transcript having desirable properties, such as a half- 
life which is longer than that of a transcript generated from the naturally occurring 
sequence. 

Moreover, the polynucleotide sequences of the present invention can be 
engineered using methods generally known in the art in order to alter polypeptide 
15 encoding sequences for a variety of reasons, including but not limited to, alterations 
which modify the cloning, processing, and/or expression of the gene product For 
example, DNA shuffling by random fragmentation and PGR reassembly of gene 
fragments and synthetic oligonucleotides may be used to engineer the nucleotide 
sequences. In addition, site-directed mutagenesis may be used to insert new restriction 
20 sites, alter glycosylation patterns, change codbn preference, produce splice variants, or 
introduce mutations, and so forth. 

In another embodiment of the invention, natural, modified, or 
recombinant nucleic acid sequences may be lighted to a heterologous sequence to 
encode a fusion protein. For example, to screen peptide libraries for inhibitors of 
25 polypeptide activity, it may be useful to encode a chimeric protein that can be 
recognized by a commercially available antibody. A fusion protein may also be 
engineered to contain a cleavage site located between the polypeptide-encoding 
sequence and the heterologous protein sequence, so that the polypeptide may be cleaved 
and purified away from the heterologous moiety. 
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Sequences encoding a desired polypeptide may be synthesized, in whole 
or in part, using chemical methods well known in the art (see Caruthers, M. H. et al. 
(1980) NucL Acids Res. Symp. Ser. 215-223, Horn, T. et al. (1980) NucL Acids Res. 
Symp. Ser. 225-232). Alternatively, the protein itself may be produced using chemical 
5 methods to synthesize the amino acid sequence of a polypeptide, or a portion thereof. 
For example, peptide synthesis can be performed using various solid-phase techniques 
(Roberge, J. Y. et al. (1995) Science 269:202-204) and automated synthesis may be 
achieved, for example, using the ABI 431 A Peptide Synthesizer (Perkin Elmer, Palo 
Alto,CA). 

10 A newly synthesized peptide may be substantially purified by preparative 

high performance liquid chromatography (eg., Creighton, T. (1983) Proteins, Structures 
and Molecular Principles, WH Freeman and Co., New York, N.Y.) or other comparable 
techniques available in the art. The composition of the synthetic peptides may be 
confirmed by amino acid analysis or sequencing (e.g., the Edman degradation 

1 5 procedure). Additionally, the amino acid sequence of a polypeptide, or any part thereof, 
may be altered during direct synthesis and/or combined using chemical methods with 
sequences from other proteins, or any part thereof, to produce a variant polypeptide. 

In order to express a desired polypeptide, the nucleotide sequences 
encoding the polypeptide, or functional equivalents, may be inserted into appropriate 

20 expression vector, Le., a vector which cbntains the necessary elements for the 
transcription and translation of the inserted coding sequence. Methods which are well 
known to those skilled in the art may be used to construct expression vectors containing 
sequences encoding a polypeptide of interest and appropriate transcriptional and 
translational control elements. These methods include in vitro recombinant DNA 

25 techniques, synthetic techniques, and in vivo genetic recombination. Such techniques 
are described, for example, in Sambrook, J. et al. (1989) Molecular Cloning, A 
Laboratory Manual, Cold Spring Harbor Press, Plainview, N.Y., and Ausubel, F. M. et 
al. (1989) Current Protocols in Molecular Biology, John Wiley & Sons, New York. 
N.Y. 
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A variety of expression vector/host systems may be utilized to contain 
and express polynucleotide sequences. These include, but are not limited to, 
microorganisms such as bacteria transformed with recombinant bacteriophage, plasmid, 
or cosmid DNA expression vectors; yeast transformed with yeast expression vectors; 
5 insect cell systems infected with virus expression vectors (e.g., baculovirus); plant cell 
systems transformed with virus expression vectors (e.g., cauliflower mosaic virus, 
CaMV; tobacco mosaic virus, TMV) or with bacterial expression vectors (e.g., Ti or 
pBR322 plasmids); or animal cell systems. 

The "control elements" or "regulatory sequences" present in an 

10 expression vector are those non-translated regions of the vector-enhancers, promoters, 
5* and 3' untranslated regions-which interact with host cellular proteins to carry out 
transcription and translation. Such elements may vary in their strength and specificity. 
Depending on the vector system and host utilized, any number of suitable transcription 
and translation elements, including constitutive and inducible promoters, may be used. 

1 5 For example, when cloning in bacterial systems, inducible promoters such as the hybrid 
lacZ promoter of the PBLUESCRIPT phagemid (Stratagene, La Jolla, Calif.) or 
PSPORT1 plasmid (Gibco BRL, Gaithersburg, MD) and the like may be used. In 
mammalian cell systems, promoters from mammalian genes or from mammalian viruses 
are generally preferred. If it is necessary to generate a cell line that contains multiple 

20 copies of the sequence encoding a polypeptide, vectors based on SV40 or EBV may be 
advantageously used with an appropriate selectable marker. 

In bacterial systems, any of a number of expression vectors may be 
selected depending upon the use intended for the expressed polypeptide. For example, 
when large quantities are needed, for example for the induction of antibodies, vectors 

25 which direct high level expression of fusion proteins that are readily purified may be 
used. Such vectors include, but are not limited to, the multifunctional E coli cloning 
and expression vectors such as BLUESCRIPT (Stratagene), in which the sequence 
encoding the polypeptide of interest may be ligated into tjie vector in frame with 
sequences for the amino-terminal Met and the subsequent 7 residues of .beta.- 

30 galactosidase so that a hybrid protein is produced; pDSf vectors (Van Heeke, G. and S. 
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M. Schuster (1989) J. Biol Chem. 26*5503-5509); and the like. pGEX Vectors 
(Promega, Madison, Wis.) may also be used to express foreign polypeptides as fusion 
proteins with glutathione S-transferase (GST). In general, such fusion proteins are 
soluble and can easily be purified from lysed cells by adsorption to glutathione-agarose 
5 beads followed by elution in the presence of free glutathione. Proteins made in such 
systems may be designed to include heparin, thrombin, or factor XA protease cleavage 
sites so that the cloned polypeptide of interest can be released from the GST moiety at 
will. 

In the yeast, Saccharomyces cerevisiae, a number of vectors containing 

10 constitutive or inducible promoters such as alpha factor, alcohol oxidase, and PGH may 
be used. For reviews, see Ausubel et al. (supra) and Grant et al. (1987) Methods 
Ertzymol. 153:516-544. 

In cases where plant expression vectors are used, the expression of 
sequences encoding polypeptides may be driven by any of a number of promoters. For 

15 example, viral promoters such as the 35S and 19S promoters of CaMV may be used 
alone or in combination with the omega leader sequence from TMV (Takamatsu, N. 
(1987) EMBO J. 6:307-31 1. Alternatively, plant promoters such as the small subunit of 
RUBISCO or heat shock promoters may be used (Coruzzi, G. et al. (1984) EMBO J. 
3:1671-1680; Broglie, R. et al. (1984) Science 22*838-843; and Winter, J. et al. (1991) 

20 Results ProbL Cell Differ. 1 7:85-105). These constructs can be introduced into plant 
cells by direct DNA transformation or pathogen-mediated transfection. Such techniques 
are described in a number of generally available reviews (see, for example, Hobbs, S. or 
Murry, L. E. in McGraw Hill Yearbook of Science and Technology (1992) McGraw 
Hill, New York, N.Y.; pp. 191-196). 

25 An insect system may also be used to express a polypeptide of interest. 

For example, in one such system, Autographa californica nuclear polyhediosis virus 
(AcNPV) is used as a vector to express foreign genes in Spodoptera frugipeida cells or 
in Trichoplusia larvae. The sequences encoding the polypeptide may be cloned into a 
non-essential region of the virus, such as the polyhedrin gene, and placed under control 
30 of the polyhedrin promoter. Successful insertion of the polypeptide-eneoding sequence 
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will render the polyhedrin gene inactive and produce recombinant virus lacking coat 
protein. The recombinant viruses may then be used to infect, for example, S. frugiperda 
cells or Trichoplusia larvae in which the polypeptide of interest may be expressed 
(Engelhard, E. K. et al. (1994) Proc. Natl Acad, Set 91 :3224-3227). 
5 In mammalian host cells, a number of viral-based expression systems are 

generally available. For example, in cases where an adenovirus is used as an expression 
vector, sequences encoding a polypeptide of interest may be ligated into an adenovirus 
transcription/translation complex consisting of the late promoter and tripartite leader 
sequence. Insertion in a non-essential El or E3 region of the viral genome may be used 

10 to obtain a viable virus which is capable of expressing the polypeptide in infected host 
cells (Logan, J. and Shenk, T. (1984) Proc. Natl Acad Set. 57:3655-3659). In addition, 
transcription enhancers, such as the Rous sarcoma virus (RSV) enhancer, may be used 
to increase expression in mammalian host cells. 

Specific initiation signals may also be used to achieve more efficient 

15 translation of sequences encoding a polypeptide of interest. Such signals include the 
ATG initiation codon and adjacent sequences. In cases where sequences encoding the 
polypeptide, its initiation codon, and upstream sequences are inserted into the 
appropriate expression vector, no additional transcriptional or translational control 
signals may be needed. However, in cases where only coding sequence, or a portion 

20 thereof, is inserted, exogenous translational control signals including the ATG initiation 
codon should be provided. Furthermore, the initiation codon should be in the correct 
reading frame to ensure translation of the entire insert. Exogenous translational 
elements and initiation codons may be of various origins, both natural and synthetic. 
The efficiency of expression may be enhanced by the inclusion of enhancers which are 

25 appropriate for the particular cell system which is used, such as those described in the 
literature (Scharf,D. etal. (1994) Results Probl Cell Differ. 20:125-162). 

In addition, a host cell strain may be chosen for its ability to modulate 
the expression of the inserted sequences or to process the expressed protein in the 
desired fashion. Such modifications of the polypeptide include, but are not limited to, 

30 acetylation, carboxylation. glycosylation, phosphorylation, lipidation, and acylation. 
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Post-translational processing which cleaves a "prepro" form of the protein may also be 
used to facilitate correct insertion, folding and/or function. Different host cells such as 
CHO, COS, HeLa, MDCK, HEK293, and WD8, which have specific cellular machinery 
and characteristic mechanisms for such post-translational activities, may be chosen to 
5 ensure the correct modification and processing of the foreign protein. 

For long-term, high-yield production of recombinant proteins, stable 
expression is generally preferred. For example, cell lines which stably express a 
polynucleotide of interest may be transformed using expression vectors which may 
contain viral origins of replication and/or endogenous expression elements and a 
10 selectable marker gene on the same or on a separate vector. Following the introduction 
of the vector, cells may be allowed to grow for 1-2 days in an enriched media before 
they are switched to selective media. The purpose of the selectable marker is to confer 
resistance to selection, and its presence allows growth and recovery of cells which 
successfully express the introduced sequences. Resistant clones of stably transformed 
15 cells may be proliferated using tissue culture techniques appropriate to the cell type. 

Any number of selection systems may be used to recover transformed 
cell lines. These include, but are not limited to, the herpes simplex virus thymidine 
kinase (Wigler, M. et al. (1977) Cell ii:223-32) and adenine phosphoribosyltransferase 
(Lowy, 1. et al. (1990) Cell 22:817-23) genes which can be employed in tLsup.- or 
20 aprtsup, cells, respectively. Also, antimetabolite, antibiotic or herbicide resistance can 
be used as the basis for selection; for example, dhfr which confers resistance to 
methotrexate (Wigler, M. et al. (1980) Proc. Natl. Acad. Sci. 77:3567-70); npt, which 
confers resistance to the aminoglycosides, neomycin and G-418 (Colbere-Garapin, F. et 
al (1981) J. Mol. Biol. 150:1-14); and als or pat, which confer resistance to 
25 chlorsulfuron and phosphinotricin acetyltransferase, respectively (Murry, supra). 
Additional selectable genes have been described, for example, trpB, which allows cells 
to utilize indole in place of tryptophan, or hisD, which allows cells to utilize histinol in 
place of histidine (Hartman, S. C. and R. C. Mulligan (1988) Proc. Natl. Acad. Sci. 
55:8047-51). The use of visible markers has gained popularity with such markers as 
30 anthocyanrns, beta-glucuronidase and its substrate GUS, and luciferase and its substrate 
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luciferin, being widely used not only to identify transformants, but also to quantify the 
amount of transient or stable protein expression attributable to a specific vector system 
(Rhodes, C. A. et al. (1995) Methods Mol Biol 55:121-131). 

Although the presence/absence of marker gene expression suggests that 
the gene of interest is also present, its presence and expression may need to be 
confirmed. For example, if the sequence encoding a polypeptide is inserted within a 
marker gene sequence, recombinant cells containing sequences can be identified by (he 
absence of marker gene function. Alternatively, a marker gene can be placed in tandem 
with a polypeptide-encoding sequence under the control of a single promoter. 
Expression of the marker gene in response to induction or selection usually indicates 
expression of the tandem gene as well. 

Alternatively, host cells that contain and express a desired 
polynucleotide sequence may be identified by a variety of procedures known to those of 
skill in the art. These procedures include, but are not limited to, DNA-DNA or DNA- 
RNA hybridizations and protein bioassay or immunoassay techniques which include, 
for example, membrane, solution, or chip based technologies for the detection and/or 
quantification of nucleic acid or protein. 

A variety of protocols for detecting and measuring the expression of 
polynucleotide-encoded products, using either polyclonal or monoclonal antibodies 
specific for the product are known in the art. Examples include enzyme-linked 
immunosorbent assay (EUSA), radioimmunoassay (RIA), and fluorescence activated 
cell sorting (FACS). A two-site, monoclonal-based immunoassay utilizing monoclonal 
antibodies reactive to two non-interfering epitopes on a given polypeptide may be 
preferred for some applications, but a competitive binding assay may also be employed. 
These and other assays are described, among other places, in Hampton, R. et al. (1990; 
Serological Methods, a Laboratory Manual, APS Press, St Paul. Minn.) and Maddox, D. 
E. et al. (1983; J. Exp. Med 755:121 1-1216). 

A wide variety of labels and conjugation techniques are known by those 
skilled in the art and may be used in various nucleic acid and amino acid assays. Means 
for producing labeled hybridization or PCR probes for detecting sequences related to 

66 



WO 02/00174 



PCT/US01/21065 



polynucleotides include oligolabeling, nick translation, end-labeling or PCR 
• amplification using a labeled nucleotide. Alternatively, the sequences, or any portions 
thereof may be cloned into a vector for the production of an mRNA probe. Such vectors 
are known in the art, are commercially available, and may be used to synthesize RNA 
5 probes in vitro by addition of an appropriate RNA polymerase such as T7, T3, or SP6 
and labeled nucleotides. These procedures may be conducted using a variety of 
commercially available kits. Suitable reporter molecules or labels, which may be used 
include radionuclides, enzymes, fluorescent, chemiluminescent, or chromogenic agents 
as well as substrates, cofactors, inhibitors, magnetic particles, and the like. 
10 Host ceils transformed with a polynucleotide sequence of interest may be 

cultured under conditions suitable for the expression and recovery of the protein from 
cell culture. The protein produced by a recombinant cell may be secreted or contained 
intracellularly depending on the sequence and/or the vector used. As will be understood 
by those of skill in the art, expression vectors containing polynucleotides of the 
15 invention may be designed to contain signal sequences which direct secretion of the 
encoded polypeptide through a prokaryotic or eukaryotic cell membrane. Other 
recombinant constructions may be used to join sequences encoding a polypeptide of 
interest to nucleotide sequence encoding a polypeptide domain which will facilitate 
purification of soluble proteins. Such purification facilitating domains include, but are 
20 not limited to, metal chelating peptides such as histidine-tryptophan modules that allow 
purification on immobilized metals, protein A domains that allow purification on 
immobilized immunoglobulin, and the domain utilized in the FLAGS extension/affinity 
purification system (Immunex Corp., Seattle, Wash.). The inclusion of cleavable linker 
sequences such as those specific for Factor XA or enterokinase (Invitrogen. San Diego, 
25 Calif.) between the purification domain and the encoded polypeptide may be used to 
facilitate purification. One such expression vector provides for expression of a fusion 
protein containing a polypeptide of interest and a nucleic acid encoding 6 histidine 
residues preceding a thioredoxin or an enterokinase cleavage site. The histidine residues 
facilitate purification on IMIAC (immobilized metal ion affinity chromatography) as 
30 described in Porath, J. et al. (1992, Prot. Exp. Purif. 3:263-281) while the enterokinase 
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cleavage site provides a means for purifying the desired polypeptide from the fusion 
protein. A discussion of vectors which contain fusion proteins is provided in Kroll, D. J. 

; et aL (1993; DNA Cell Biol 72:441-453). 

In addition to recombinant production methods, polypeptides of the 
5 invention, and fragments thereof, may be produced by direct peptide synthesis using 
solid-phase techniques (Merrifield J. (1963) J. Am. Chem. Soc. 55:2149-2154). Protein 
synthesis may be performed using manual techniques or by automation. Automated 
synthesis may be achieved, for example, using Applied Biosystems 431 A Peptide 
Synthesizer (Perkin Elmer). Alternatively, various fragments may be chemically 

10 synthesized separately and combined using chemical methods to produce the full length 
molecule. 

Antibody Compositions, Fragments Thereof and Other Binding Agents 

According to another aspect, the present invention further provides 
binding agents, such as antibodies and antigen-binding fragments thereof, that exhibit 

1 5 immunological binding to a tumor polypeptide disclosed herein, or to a portion, variant 
or derivative thereof. An antibody, or antigen-binding fragment thereof, is said to 
"specifically bind," "immunogically bind/ 5 and/or is "immunologically reactive" to a 
polypeptide of the invention if it reacts at a detectable level (within, for example, an 
ELISA assay) with the polypeptide, and does not react detectably with unrelated 

20 polypeptides under similar conditions. 

Immunological binding, as used in this context, generally refers to the 
non-covalent interactions of the type which occur between an immunoglobulin 
molecule and an antigen for which the immunoglobulin is specific. The strength, or 
affinity of immunological binding interactions can be expressed in terms of the 

25 dissociation constant (K<j) of the interaction, wherein a smaller K<j represents a greater 
affinity. Immunological binding properties of selected polypeptides can be quantified 
using methods well known in the art. One such method entails measuring the rates of 
antigen-binding site/antigen complex formation and dissociation, wherein those rates 
depend on the concentrations of the complex partners, the affinity of the interaction, and 



68 



WO 02/00174 



PCT/US01/21065 



on geometric parameters that equally influence the rate in both directions. Thus, both 
the "on rate constant" and the "off rate constant" (Kotr) can be determined by 
calculation of the concentrations and the actual rates of association and dissociation. 
The ratio of K^M™ enables cancellation of all parameters not related to affinity, and is 
5 thus equal to the dissociation constant K d . See, generally, Davies et al. (1990) Annual 
Rev. Biochem. 59:439-473. 

An "antigen-binding site," or "binding portion" of an antibody refers to 
the part of the immunoglobulin molecule that participates in antigen binding. The 
antigen binding site is formed by amino acid residues of the N-terminal variable ("V") 
10 regions of the heavy ("H") and light ("L") chains. Three highly divergent stretches 
within the V regions of the heavy and light chains are referred to as "hypervariable 
regions" which are interposed between more conserved flanking stretches known as 
"fennework regions," or "FRs". Thus the term "FR" refers to amino acid sequences 
which are naturally found between and adjacent to hypervariable regions in 
15 immunoglobulins. In an antibody molecule, the three hypervariable regions of a light 
chain and the three hypervariable regions of a heavy chain are disposed relative to each 
other in three dimensional space to form an antigen-binding surface. The antigen- 
binding surface is complementary to the three-dimensional surface of a bound antigen, 
and the three hypervariable regions of each of the heavy and light chains are referred to 
20 as "complementarity-determining regions," or '"CDRs." 

Binding agents may be further capable of differentiating between patients 
with and without a cancer, such as lung cancer, using the representative assays provided 
herein. For example, antibodies or other binding agents that bind to a tumor protein 
will preferably generate a signal indicating the presence of a cancer in at least about 
25 20% of patients with the disease, more preferably at least about 30% of patients. 
Alternatively, or in addition, the antibody will generate a negative signal indicating the 
absence of the disease in at least about 90% of individuals without the cancer. To 
determine whether a binding agent satisfies this requirement^ biological samples (e.g., 
blood, sera, sputum, urine and/or tumor biopsies) from patients with and without a 
30 cancer (as determined using standard clinical tests) may be assayed as described herein 
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for the presence of polypeptides that bind to the binding agent Preferably, a statistically 
significant number of samples with and without the disease will be assayed. Each 
binding agent should satisfy the above criteria; however, those of ordinary skill in the 
art will recognize that binding agents may be used in combination to improve 
5 sensitivity. 

Any agent that satisfies the above requirements may be a binding agent. 
For example, a binding agent may be a ribosome, with or without a peptide component, 
an RNA molecule or a polypeptide. In a preferred embodiment, a binding agent is an 
antibody or an antigen-binding fragment thereof. Antibodies may be prepared by any of 

10 a variety of techniques known to those of ordinary skill in the art. See, e.g., Harlow and 
Lane, Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratory, 1988. In 
general, antibodies can be produced by cell culture techniques, including the generation 
of monoclonal antibodies as described herein, or via transfection of antibody genes into 
suitable bacterial or mammalian cell hosts, in order to allow for the production of 

15 recombinant antibodies. In one technique, an immunogen comprising the polypeptide is 
initially injected into any of a wide variety of mammals {e.g., mice, rats, rabbits, sheep 
or goats). In this step, the polypeptides of this invention may serve as the immunogen 
without modification. Alternatively, particularly for relatively short polypeptides, a 
superior immune response may be elicited if the polypeptide is joined to a carrier 

20 protein, such as bovine serum albumin or keyhole limpet hemocy anin. The immunogen 
is injected into the animal host, preferably according to a predetermined schedule 
incorporating one or more booster immunizations, and the animals are bled periodically. 
Polyclonal antibodies specific for the polypeptide may then be purified from such 
antisera by, for example, affinity chromatography using the polypeptide coupled to a 

25 suitable solid support. 

Monoclonal antibodies specific for an antigenic polypeptide of interest 
may be prepared, for example, using the technique of Kohler and Milstein, Eur. J. 
Immunol. 6:5 11-519, 1976, and improvements thereto. Briefly, these methods involve 
the preparation of immortal cell lines capable of producing antibodies having the 

30 desired specificity (le., reactivity with the polypeptide of interest). Such cell lines may 
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be produced, for example, from spleen cells obtained from an animal immunized as 
described above. The spleen cells are then immortalized by, for example, fusion with a 
myeloma cell fusion partner, preferably one that is syngeneic with the immunized 
animal. A variety of fusion techniques may be employed. For example, the spleen cells 
5 and myeloma cells may be combined with a nonionic detergent for a few minutes and 
then plated at low density on a selective medium that supports the growth of hybrid 
cells, but not myeloma cells. A preferred selection technique uses HAT (hypoxanthine, 
aminopterin, thymidine) selection. After a sufficient time, usually about 1 to 2 weeks, 
colonies of hybrids are observed. Single colonies are selected and their culture 
10 supernatants tested for binding activity against the polypeptide. Hybridomas having 
high reactivity and specificity are preferred. 

Monoclonal antibodies may be isolated from the supernatants of growing 
hybridoma colonies. In addition, various techniques may be employed to enhance the 
yield, such as injection of the hybridoma cell line into the peritoneal cavity of a suitable 
15 vertebrate host, such as a mouse. Monoclonal antibodies may then be harvested from 
the ascites fluid or the blood. Contaminants may be removed from the antibodies by 
conventional techniques, such as chromatography, gel filtration, precipitation, and 
extraction. The polypeptides of this invention may be used in the purification process 
in, for example, an affinity chromatography step. 
20 a number of therapeutically useful molecules are known in the art which 

comprise antigen-binding sites that are capable of exhibiting immunological binding 
properties of an antibody molecule. The proteolytic enzyme papain preferentially 
cleaves IgG molecules to yield several fragments, two of which (the "F(ab)" fragments) 
each comprise a covalent heterodimer that includes an intact antigen-binding site. The 
25 enzyme pepsin is able to cleave IgG molecules to provide several fragments, including 
the "flabfe " fragment which comprises both antigen-binding sites. An "Fv" fragment 
can be produced by preferential proteolytic cleavage of an IgM, and on rare occasions 
IgG or IgA immunoglobulin molecule. Fv fragments are, however, more commonly 
derived using recombinant techniques known in the art. The Fv fragment includes a 
30 non-covalent V H ::V L heterodimer including an antigen-binding site which retains much 
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of the antigen recognition and binding capabilities of the native antibody molecule. 
Inbar et al. (1972) Proc. Nat Acad. Sci. USA 69:2659-2662; Hochman et al. (1976) 
Biochem 15:2706-2710; and Ehrlich et al. (1980) Biochem 19:4091-4096. 

A single chain Fv ("sFv") polypeptide is a covalently linked V H ::V L 
5 heterodimer which is expressed from a gene fusion including V H - and V L -encoding 
genes linked by a peptide-encoding linker. Huston et al. (1988) Proc. Nat. Acad. Sci. 
USA 85(16):5879-5883. A number of methods have been described to discern chemical 
structures for converting the naturally aggregated-but chemically separated-light and 
heavy polypeptide chains from an antibody V region into an sFv molecule which will 

10 fold into a three dimensional structure substantially similar to the structure of an 
antigen-binding site. See, e.g., U.S. Pat. Nos. 5,091,513 and 5,132,405, to Huston et al.; 
and U.S. Pat No. 4,946,778, to Ladner et al. 

Each of the above-described molecules includes a heavy chain and a 
light chain CDR set, respectively interposed between a heavy chain and a light chain FR 

15 set which provide support to the CDRS and define the spatial relationship of the CDRs 
relative to each other. As used herein, the term "CDR set" refers to the three 
hypervariable regions of a heavy or light chain V region. Proceeding from the N- 
terminus of a heavy or light chain, these regions are denoted as "CDR!," "CDR^" and 
"CDR3" respectively. An antigen-binding site, therefore, includes six CDRs, 

20 comprising the CDR set from each of a heavy 'and a light chain V regioa A polypeptide 
comprising a single CDR, (e.g. 9 a CDR1, CDR2 or CDR3) is referred to herein as a 
"molecular recognition unit." Crystallographic analysis of a number of antigen-antibody 
complexes has demonstrated that the amino acid residues of CDRs form extensive 
contact with bound antigen, wherein the most extensive antigen contact is with the 

25 heavy chain CDR3. Thus, the molecular recognition units are primarily responsible for 
the specificity of an antigen-binding site. 

As used herein, the term n FR set" refers to the four flanking amino acid 
sequences which frame the CDRs of a CDR set of a heavy, or light chain V region. 
Some FR residues may contact bound antigen; however, FRs are primarily responsible 

30 for folding the V region into the antigen-binding site, particularly the FR residues 
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directly adjacent to the CDRS. Within FRs, certain amino residues and certain structural 
features are very highly conserved. In this regard, all V region sequences contain an 
internal disulfide loop of around 90 amino acid residues. When the V regions fold into a 
binding-site, the CDRs are displayed as projecting loop motifs which form an antigen- 
5 binding surface. It is generally recognized that there are conserved structural regions of 
FRs which influence the folded shape of the CDR loops into certain "canonical- 
structures-regardless of the precise CDR amino acid sequence. Further, certain FR 
residues are known to participate in non-covalent interdomain contacts which stabilize 
the interaction of the antibody heavy and fight chains. 
10 A num ber of "humanized" antibody molecules comprising an antigen- 

binding site derived from anon-human immunoglobulin have been described, including 
chimeric antibodies having rodent V regions and their associated CDRs fused to human 
constant domains (Winter et al. (1991) Nature 349:293-299; lx>buglio et al. (1989) 
Proc. Nat. Acad. Sci. USA 86:4220-4224; Shaw et al. (1987) J Immunol. 138:4534- 
15 4538; and Brown et al. (1987) Cancer Res. 47:3577-3583), rodent CDRs grafted into a 
human supporting FR prior to fusion with an appropriate human antibody constant 
domain (Riechmann et al. (1988) Nature 332:323-327; Verhoeyen et al. (1988) Science 
239:1534-1536; and Jones et al. (1986) Nature 321:522-525), and rodent CDRs 
supported by recombinantly veneered rodent FRs (European Patent Publication No. 
20 519,596, published Dec. 23, 1992). These humanized" molecules are designed to 
minimize unwanted immunological response toward rodent antihuman antibody 
molecules which limits the duration and effectiveness of therapeutic applications of 
those moieties in human recipients. 

As used herein, the terms "veneered FRs" and "recombinantly veneered 
25 FRs" refer to the selective replacement of FR residues from, e.g., a rodent heavy or light 
chain V region, with human FR residues in order to provide a xenogeneic molecule 
comprising an antigen-binding site which retains substantially all of the native FR 
polypeptide folding structure. Veneering techniques are based,on the understanding that 
the ligand binding characteristics of an antigen-binding site are determined primarily by 
30 the structure and relative disposition of the heavy and light chain CDR sets within the 
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antigen-binding surface. Davies et al. (1990) Ann. Rev. Biochera. 59:439-473. Thus, 
antigen binding specificity can be preserved in a humanized antibody only wherein the 
CDR structures, their interaction with each other, and their interaction with the rest of 
the V region domains are carefully maintained. By using veneering techniques, exterior 
5 {e.g., solvent-accessible) FR residues which are readily encountered by the immune 
system are selectively replaced with human residues to provide a hybrid molecule that 
comprises either a weakly immunogenic, or substantially non-immunogenic veneered 
surface. 

The process of veneering makes use of the available sequence data for 
10 human antibody variable domains compiled by Kabat et al., in Sequences of Proteins of 
Immunological Interest, 4th ed., (U.S. Dept. of Health and Human Services, U.S. 
Government Printing Office, 1987), updates to the Kabat database, and other accessible 
U.S. and foreign databases (both nucleic acid and protein). Solvent accessibilities of V 
region amino acids can be deduced from the known three-dimensional structure for 

15 human and murine antibody fragments. There are two general steps in veneering a 
murine antigen-binding site. Initially, the FRs of the variable domains of an antibody 
molecule of interest are compared with corresponding FR sequences of human variable 
domains obtained from the above-identified sources. The most homologous human V 
regions are then compared residue by residue to corresponding murine amino acids. The 

20 residues in the murine FR which differ from the human counterpart are replaced by the 
residues present in the human moiety using recombinant techniques well known in the 
art. Residue switching is only carried out with moieties which are at least partially 
exposed (solvent accessible), and care is exercised in the replacement of amino acid 
residues which may have a significant effect on the tertiary structure of V region 

25 domains, such as proline, glycine and charged amino acids. 

In this manner, the resultant "veneered" murine antigen-binding sites are 
thus designed to retain the murine CDR residues, the residues substantially adjacent to 
the CDRs, the residues identified as buried or mostly buried (solvent inaccessible), the 
residues believed to participate in non-covalent (e.g., electrostatic and hydrophobic) 

30 contacts between heavy and light chain domains, and the residues from conserved 
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structural regions of the FRs which are believed to influence the "canonical" tertiary 
structures of the CDR loops. These design criteria are then used to prepare recombinant 
nucleotide sequences which combine the CDRs of both the heavy and light chain of a 
murine antigen-binding site into human-appearing FRs that can be used to transfect 
5 mammalian cells for the expression of recombinant human antibodies which exhibit the 
antigen specificity of the murine antibody molecule. 

In another embodiment of the invention, monoclonal antibodies of the 
present invention may be coupled to one or more therapeutic agents. Suitable agents in 
this regard include radionuclides, differentiation inducers, drugs, toxins, and derivatives 
10 thereof. Preferred radionuclides include 90 Y, 123 I, 125 I, !31 I, ,86 Re, ,88 Re, 2n At, and 
212 Bi. Preferred drugs include methotrexate, and pyrimidine and purine analogs. 
Preferred differentiation inducers include phorbol esters and butyric acid. Preferred 
toxins include ricin, abrin, diptheria toxin, cholera toxin, gelonin, Pseudomonas 
exotoxin, Shigella toxin, and pokeweed antiviral protein. 
15 A therapeutic agent may be coupled (e.g., covalently bonded) to a 

suitable monoclonal antibody either directly or indirectly (e.g., via a linker group). A 
direct reaction between an agent and an antibody is possible when each possesses a 
substituent capable of reacting with the other. For example, a nucleophilic group, such 
as an amino or sulfhydiyl group, on one may be capable of reacting with a carbonyl- 
20 containing group, such as an anhydride or 'an acid halide, or with an alkyl group 
containing a good leaving group (e.g., a halide) on the other. 

Alternatively, it may be desirable to couple a therapeutic agent and an 
antibody via a linker group. A linker group can function as a spacer to distance an 
antibody from an agent in order to avoid interference with binding capabilities. A linker 
25 group can also serve to increase the chemical reactivity of a substituent on an agent or 
an antibody, and thus increase the coupling efficiency. An increase in chemical 
reactivity may also facilitate the use of agents, or functional groups on agents, which 
otherwise would not be possible. i 

It will be evident to those skilled in the art that a variety of Afunctional 
30 or polyfunctional reagents, both homo- and hetero-functional (such as those described in 
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the catalog of the Pierce Chemical Co., Rockford, IL), may be employed as the linker 
group. Coupling may be effected, for example, through amino groups, carboxyl groups, 
sulfhydryl groups or oxidized carbohydrate residues. There are numerous references 
describing such methodology, e.g., U.S. Patent No. 4,671,958, to Rodwell et al. 
5 Where a therapeutic agent is more potent when free from the antibody 

portion of the immunoconjugates of the present invention, it may be desirable to use a 
linker group which is cleavable during or upon internalization into a cell. A number of 
different cleavable linker groups have been described. The mechanisms for the 
intracellular release of an agent from these linker groups include cleavage by reduction 
10 of a disulfide bond (e.g., U.S. Patent No. 4,489,710, to Spider), by irradiation of a 
photolabile bond (e.g., U.S. Patent No. 4,625,014, to Senter etal.), by hydrolysis of 
derivatized amino acid side chains (e.g., U.S. Patent No. 4,638,045, to Kohn et al.), by 
serum complement-mediated hydrolysis (e.g, U.S. Patent No. 4,671,958, to Rodwell 
et al.), and acid-catalyzed hydrolysis (e.g., U.S. Patent No. 4,569,789, to Blattler et al.). 
15 K may be desirable to couple more than one agent to an antibody. In one 

embodiment, multiple molecules of an agent are coupled to one antibody molecule. In 
another embodiment, more than one type of agent may be coupled to one antibody. 
Regardless of the particular embodiment, immunoconjugates with more than one agent 
may be prepared in a variety of ways. For example, more than one agent may be 
20 coupled directiy to an antibody molecule, Or linkers that provide multiple sites for 
attachment can be used. Alternatively, a carrier can be used. 

A carrier may bear the agents in a variety of ways, including covalent 
bonding either directly or via a linker group. Suitable carriers include proteins such as 
albumins (e.g., U.S. Patent No. 4,507,234, to Kato et al.), peptides and polysaccharides 
25 such as aminodextran (e.g, U.S. Patent No. 4,699,784, to Shih et al.). A earner may 
also bear an agent by noncovalent bonding or by encapsulation, such as within a 
liposome vesicle (e.g., U.S. Patent Nos. 4,429,008 and 4,873,088). Carriers specific for 
radionuclide agents include radiohalogenated small molecules and chelating 
compounds. For example, U.S. Patent No. 4,735,792 discloses representative 
30 radiohalogenated small molecules and their synthesis. A radionuclide chelate may be 
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formed from chelating compounds that include those containing nitrogen and sulfur 
atoms as the donor atoms for binding the metal, or metal oxide, radionuclide. For 
example, U.S. Patent No. 4,673,562, to Davison et al. discloses representative chelating 
compounds and their synthesis. 



5 T Cell Compositions 

The present invention, in another aspect, provides T cells specific for a 
tumor polypeptide disclosed herein, or for a variant or derivative thereof. Such cells 
may generally be prepared in vitro or ex vivo, using standard procedures. For example, 
T cells may be isolated from bone marrow, peripheral blood, or a fraction of bone 
10 marrow or peripheral blood of a patient, using a commercially available cell separation 
system, such as the Isolex™ System, available from Nexell Therapeutics, Inc. (Irvine, 
CA; see also U.S. Patent No. 5,240,856; U.S. PatentNo. 5,215,926; WO 89/06280; WO 
91/16116 and WO 92/07243). Alternatively, T cells may be derived from related or 
unrelated humans, non-human mammals, cell lines or cultures, 
j 5 x ce n s may be stimulated with a polypeptide, polynucleotide encoding a 

polypeptide and/or an antigen presenting cell (APQ that expresses such a polypeptide. 
Such stimulation is performed under conditions and for a time sufficient to permit the 
generation of T cells that are specific for the polypeptide of interest. Preferably, atumor 
polypeptide or polynucleotide of the invention is present within a delivery vehicle, such 
20 as a microsphere, to facilitate the generation of specific T cells. 

T cells are considered to be specific for a polypeptide of the present 
invention if the T cells specifically proliferate, secrete cytokines or kill target cells 
coated with the polypeptide or expressing a gene encoding the polypeptide. T cell 
specificity may be evaluated using any of a variety of standard techniques. For 
25 example, within a chromium release assay or proliferation assay, a stimulation index of 
more than two fold increase in lysis and/or proliferation, compared to negative controls, 
indicates T cell specificity. Such assays may be performed, for example, as described in 
Chen et al., Cancer Res. 54:1065-1070, 1994. Alternatively, detection of the 
proliferation of T cells may be accomplished by a variety of known techniques. For 
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example, T cell proliferation can be detected by measuring an increased rate of DNA 
synthesis (e.g., by pulse-labeling cultures of T cells with tritiated thymidine and 
measuring the amount of tritiated thymidine incorporated into DNA). Contact with a 
tumor polypeptide (100 ng/ml - 100 ng/ml, preferably 200 ng/ml - 25 jig/ml) for 3 - 7 
5 days will typically result in at least a two fold increase in proliferation of the T cells. 
Contact as described above for 2-3 hours should result in activation of the T cells, as 
measured using standard cytokine assays in which a two fold increase in the level of 
cytokine release (e.g., TNF or IFN-y) is indicative of T cell activation (see Coligan et 
al., Current Protocols in Immunology, vol. 1, Wiley Interscience (Greene 1998)). T 

10 cells that have been activated in response to a tumor polypeptide, polynucleotide or 
polypeptide-expressing APC may be CD4 + and/or CD8 + . Tumor polypeptide-specific T 
cells may be expanded using standard techniques. Within preferred embodiments, the T 
cells are derived from a patient, a related donor or an unrelated donor, and are 
administered to the patient following stimulation and expansion. 

15 For therapeutic purposes, CD4+ or CD8 + T cells that proliferate in 

response to a tumor polypeptide, polynucleotide or APC can be expanded in number 
either in vitro or in vivo. Proliferation of such T cells in vitro may be accomplished in a 
variety of ways. For example, the T cells can be re-exposed to a tumor polypeptide, or a 
short peptide corresponding to an immunogenic portion of such a polypeptide, with or 

20 without the addition of T cell growth factors; such as interleukin-2, and/or stimulator 
cells that synthesize a tumor polypeptide. Alternatively, one or more T cells that 
proliferate in the presence of the tumor polypeptide can be expanded in number by 
cloning. Methods for cloning cells are well known in the art, and include limiting 
dilution. 

25 

T Cell Receptor Compositions 

The T cell receptor (TCR) consists of 2 different, highly variable 
polypeptide chains, termed the T-cell receptor a and (5 chains, that are linked by a 
disulfide bond (Janeway, Travers, Walport. Imrnunobiology. Fourth Ed., 148-159. 
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Elsevier Science Ltd/Garland Publishing. 1999). The o/p heterodimer complexes with 
the invariant CD3 chains at the cell membrane. This complex recognizes specific 
antigenic peptides bound to MHC molecules. The enormous diversity of TCR 
specificities is generated much like immunoglobulin diversity, through somatic gene 
5 rearrangement. The p chain genes contain over 50 variable (V), 2 diversity (D), over 10 
joining (J) segments, and 2 constant region segments (C). The a chain genes contain 
over 70 V segments, and over 60 J segments but no D segments, as well as one C 
segment. During T cell development in the thymus, the D to J gene rearrangement of 
the p chain occurs, followed by the V gene segment rearrangement to the D J. This 
10 functional VDJp exon is transcribed and spliced to join to a Cp. For the a chain, a V a 
gene segment rearranges to a J a gene segment to create the functional exon that is then 
transcribed and spliced to the C a . Diversity is further increased during the 
recombination process by the random addition of P and N-nucleotides between the V, 
D, and J segments of the p chain and between the V and J segments in the a chain 
15 (Janeway, Travers, Walport Immunobiology. Fourth Ed., 98 and 150. Elsevier Science 
Ltd/Garland Publishing. 1999). 

The present invention, in another aspect, provides TCRs specific for a 
polypeptide disclosed herein, or for a variant or derivative thereof. In accordance with 
the present invention, polynucleotide and amino acid sequences are provided for the V-J 
20 or V-D-J junctional regions or parts thereof for the alpha and beta chains of the T-cell 
receptor which recognize tumor polypeptides described herein. In general, this aspect 
of the invention relates to T-cell receptors which recognize or bind tumor polypeptides 
presented in the context of MHC. In a preferred embodiment the tumor antigens 
recognized by the T-cell receptors comprise a polypeptide of the present invention. For 
25 example, cDNA encoding a TCR specific for a Jiimor peptide can be isolated from T 
cells specific for a tumor polypeptide using standard molecular biological and 
recombinant DNA techniques. 

This invention further includes the T-cell receptors or analogs thereof 
having substantially the same function or activity as the T-cell receptors of this 
30 invention which recognize or bind tumor polypeptides. Such receptors include, but are 
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not limited to, a fragment of the receptor, or a substitution, addition or deletion mutant 
of a T-cell receptor provided herein. This invention also encompasses polypeptides or 
peptides that are substantially homologous to the T-cell receptors provided herein or 
that retain substantially the same activity. The term "analog" includes any protein or 
5 polypeptide having an amino acid residue sequence substantially identical to the T-cell 
receptors provided herein in which one or more residues, preferably no more than 5 
residues, more preferably no more than 25 residues have been conservatively substituted 
with a functionally similar residue and which displays the functional aspects of the T- 
cell receptor as described herein. 

10 The present invention further provides for suitable mammalian host 

cells, for example, non-specific T cells, that are transfected with a polynucleotide 
encoding TCRs specific for a polypeptide described herein, thereby rendering the host 
cell specific for the polypeptide. The a and p chains of the TCR may be contained on 
separate expression vectors or alternatively, on a single expression vector that also 

15 contains an internal ribosome entry site (IRES) for cap-independent translation of the 
gene downstream of the IRES. Said host cells expressing TCRs specific for the 
polypeptide may be used, for example, for adoptive immunotherapy of lung cancer as 
discussed further below. 

In further aspects of the present invention, cloned TCRs specific for a 
20 polypeptide recited herein may be used in a kit for the diagnosis of lung cancer. For 
example, the nucleic acid sequence or portions thereof, of tumor-specific TCRs can be 
used as probes or primers for the detection of expression of the rearranged genes 
encoding the specific TCR in a biological sample. Therefore, the present invention 
further provides for an assay for detecting messenger RNA or DNA encoding the TCR 
25 specific for a polypeptide. 

Pharmaceutical Compositions 

In additional embodiments, the present invention concerns formulation 
of one or more of the polynucleotide, polypeptide, T-cell and/or antibody compositions 



80 



WO 02/00174 



PCT/US01/21065 



disclosed, herein in pharmaceutically-acceptable carriers for administration to a cell or 
an animal, either alone, or in combination with one or more other modalities of therapy. 

It will be understood that, if desired, a composition as disclosed herein 
may be administered in combination with other agents as well, such as, e.g., other 
5 proteins or polypeptides or various pharmaceutically-active agents. In fact, there is 
virtually no limit to other components that may also be included, given that the 
additional agents do not cause a significant adverse effect upon contact with the target 
cells or host tissues. The compositions may thus be delivered along with various other 
agents as required in the particular instance. Such compositions may be purified from 
10 host cells or other biological sources, or alternatively may be chemically synthesized as 
described herein. likewise, such compositions may further comprise substituted or 
derivatized RNA or DNA compositions. 

Therefore, in another aspect of the present invention, pharmaceutical 
compositions are provided comprising one or more of the polynucleotide, polypeptide, 
15 antibody, and/or T-cell compositions described herein in combination with a 
physiologically acceptable carrier. In certain preferred embodiments, the 
pharmaceutical compositions of the invention comprise immunogenic polynucleotide 
and/or polypeptide compositions of the invention for use in prophylactic and theraputic 
vaccine applications. Vaccine preparation is generally described in, for example, M.F. 
20 Powell and M J. Newman, eds., "Vaccine Design (the subunit and adjuvant approach)," 
Plenum Press (NY, 1995). Generally, such compositions will comprise one or more 
polynucleotide and/or polypeptide compositions of the present invention in combination 
with one or more immunostimulants. 

It will be apparent that any of the pharmaceutical compositions described 
25 herein can contain pharmaceutical^ acceptable salts of the polynucleotides and 
polypeptides of the invention. Such salts can be prepared, for example, from 
pharmaceutical^ acceptable non-toxic bases, including organic bases (e.g., salts of 
primary, secondary and tertiary amines and basic amino acids) and inorganic bases {e.g., 
sodium, potassium, lithium, ammonium, calcium and magnesium salts). 
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/ In another embodiment, illustrative immunogenic compositions, e.g., 
vaccine compositions, of the present invention comprise DNA encoding one or more of 
the polypeptides as described above, such that the polypeptide is generated in situ. As 
noted above, the polynucleotide may be administered within any of a variety of delivery 
5 systems known to those of ordinary skill in the art Indeed, numerous gene delivery 
techniques are well known in the art, such as those described by Rolland, Crit. Rev. 
Therap. Drug Carrier Systems 75:143-198, 1998, and references cited therein. 
Appropriate polynucleotide expression systems will, of course, contain the necessary 
regulatory DNA regulatory sequences for expression in a patient (such as a suitable 
1 0 promoter and terminating signal). Alternatively, bacterial delivery systems may involve 
the administration of a bacterium (such as Bacillus-Calmette-Guerriri) that expresses an 
immunogenic portion of the polypeptide on its cell surface or secretes such an epitope. 

Therefore, in certain embodiments, polynucleotides encoding 
immunogenic polypeptides described herein are introduced into suitable mammalian 

15 host cells for expression using any of a number of known viral-based systems. In one 
illustrative embodiment, retroviruses provide a convenient and effective platform for 
gene delivery systems. A selected nucleotide sequence encoding a polypeptide of the 
present invention can be inserted into a vector and packaged in retroviral particles using 
techniques known in the art. The recombinant virus can then be isolated and delivered 

20 to a subject. A number of illustrative retroviral systems have been described (e.g., UJS. 
Pat. No. 5,219,740; Miller and Rosman (1989) BioTechniques 7:980-990; Miller, A. D. 
(1990) Human Gene Therapy 1:5-14; Scarpa et al. (1991) Virology 180:849-852; Burns 
et al. (1993) Proc. Natl. Acad. Sci. USA 90:8033-8037; and Boris-Lawrie and Temin 
(1993) Cur. Opin. Genet. Develop. 3:102-109. 

25 In addition, a number of illustrative adenovirus-based systems have also 

been described. Unlike retroviruses which integrate into the host genome, adenoviruses 
persist extrachromosomally thus minimizing the risks associated with insertional 
mutagenesis (Haj-Ahmad and Graham (1986) J. Virol. 57:267 r 274; Bett et al. (1993) J. 
Virol. 67:591 1-5921; Mittereder et al. (1994) Human Gene Therapy 5:717-729; Seth et 

30 al. (1994) J. Virol. 68:933-940; Barr et al. (1994) Gene Therapy 1:51-58; Berkner, K. L. 
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(1988) BjoTecbniques 6:616-629; and Rich et al. (1993) Human Gene Therapy 4:461- 
476). 

Various adeno-associated virus (AAV) vector systems have also been 
developed for polynucleotide delivery. AAV vectors can be readily constructed using 
5 techniques well known in the art. See, e.g., U.S. Pat. Nos. 5,173,414 and 5,139,941; 
International Publication Nos. WO 92/01070 and WO 93/03769; Lebkowski et al. 
(1988) Molec. Cell. Biol. 8:3988-3996; Vincent et al. (1990) Vaccines 90 (Cold Spring 
Harbor Laboratory Press); Carter, B. J. (1992) Current Opinion in Biotechnology 3:533- 
539; Muzyczka, N. (1992) Current Topics in Microbiol, and Immunol. 158:97-129; 

10 Kotin, R. M. (1994) Human Gene Therapy 5:793-801; Shelling and Smith (1994) Gene 
Therapy 1:165-169; and Zhou et al. (1994) J. Exp. Med. 179:1867-1875. 

Additional viral vectors useful for delivering the polynucleotides 
encoding polypeptides of the present invention by gene transfer include those derived 
from the pox family of viruses, such as vaccinia virus and avian poxvirus. By way of 

15 example, vaccinia virus recombinants expressing the novel molecules can be 
constructed as follows. The DNA encoding a polypeptide is first inserted into an 
appropriate vector so that it is adjacent to a vaccinia promoter and flanking vaccinia 
DNA sequences, such as the sequence encoding thymidine kinase (TK). This vector is 
then used to transfect cells which are simultaneously infected with vaccinia. 

20 Homologous recombination serves to insert the vaccinia promoter plus the gene 
encoding the polypeptide of interest into the viral genome. The resulting TK.sup.(-) 
recombinant can be selected by culturing the cells in the presence of 5- 
bromodeoxyuridine and picking viral plaques resistant thereto. 

A vaccinia-based infection/transfection system can be conveniently used 

25 to provide for inducible, transient expression or coexpression of one or more 
polypeptides described herein in host cells of an organism. In this particular system, 
cells are first infected in vitro with a vaccinia virus recombinant that encodes the 
bacteriophage T7 RNA polymerase. This polymerase displays exquisite specificity in 
that it only transcribes templates bearing T7 promoters. Following infection, cells are 

30 transfected with the polynucleotide or polynucleotides of interest, driven by a T7 
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promoter. The polymerase expressed in the cytoplasm from the vaccinia virus 
recombinant transcribes the transfected DNA into RNA which is then translated into 
polypeptide by the host translation^ machinery. The method provides for high level, 
transient, cytoplasmic production of large quantities of RNA and its translation 
5 products. See, e.g., Elroy-Stein and Moss, Proc. Natl. Acad. Sci. USA (1990) 87:6743- 
6747; Fuerst et al. Proc. Natl. Acad. Sci. USA (1986) 83:8122-8126. 

Alternatively, avipoxviruses, such as the fowlpox and canarypox viruses, 
can also be used to deliver the coding sequences of interest. Recombinant avipox 
viruses, expressing immunogens from mammalian pathogens, are known to confer 

10 protective immunity when administered to non-avian species. The use of an Avipox 
vector is particularly desirable in human and other mammalian species since members 
of the Avipox genus can only productively replicate in susceptible avian species and 
therefore are not infective in mammalian cells. Methods for producing recombinant 
Avipoxviruses are known in the art and employ genetic recombination, as described 

15 above with respect to the production of vaccinia viruses. See, e.g., WO 91/12882; WO 
89/03429; and WO 92/03545. 

Any of a number of alphavirus vectors can also be used for delivery of 
polynucleotide compositions of the present invention, such as those vectors described in 
U.S. Patent Nos. 5,843,723; 6,015,686; 6,008,035 and 6,015,694. Certain vectors based 

20 on Venezuelan Equine Encephalitis (VEE) dan also be used, illustrative examples of 
which can be found in U.S. Patent Nos. 5,505,947 and 5,643,576. 

Moreover, molecular conjugate vectors, such as the adenovirus chimeric 
vectors described in Michael et al. J. Biol. Chem. (1993) 268:6866-6869 and Wagner et 
al. Proc. Natl. Acad. Sci. USA (1992) 89:6099-6103, can also be used for gene delivery 

25 under the invention. 

Additional illustrative information on these and other known viral-based 
delivery systems can be found, for example, in Fisher-Hoch et al., Proc. Natl Acad Sci 
USA 56:317-321, 1989; Flexner et al., Arm. N.Y. Acad Sci. 5^:86-103, 1989; Flexner 
et al, Vaccine 5:17-21, 1990; U.S. Patent Nos. 4,603,112, 4,769,330, and 5,017,487; 

30 WO 89/01973; U.S. Patent No. 4,777,127; GB 2,200,65 1; EP 0,345,242; 
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WO 91/02805; Berkner, Biotechniques 5:616-627, 1988; Rosenfeld et aL, Science 
252:431-434, 1991; Kolls et al., Proc. Natl Acad. Set USA Pi:215-219, 1994; 
Kass-Eisler et al., Proc. Natl Acad. Set USA 90:1 1498-11502, 1993; Guzman et aL, 
Circulation 55:2838-2848, 1993; and Guzman et al., Cir. Res. 75:1202-1207, 1993. 
5 In certain embodiments, a polynucleotide may be integrated into the 

genome of a target cell. This integration may be in the specific location and orientation 
via homologous recombination (gene replacement) or it may be integrated in a random, 
non-specific location (gene augmentation). In yet further embodiments, the 
polynucleotide may be stably maintained in the cell as a separate, episomal segment of 
10 DNA. Such polynucleotide segments or "episomes" encode sequences sufficient to 
permit maintenance and replication independent of or in synchronization with the host 
cell cycle- The manner in which the expression construct is delivered to a cell and 
where in the cell the polynucleotide remains is dependent on the type of expression 
construct employed. 

15 In another embodiment of the invention, a polynucleotide is 

administered/delivered as "naked" DNA, for example as described in Ulmer et al., 
Science 2JP:1745-1749, 1993 and reviewed by Cohen, Science 259: 169 1-1 692, 1993. 
The uptake of naked DNA may be increased by coating the DNA onto biodegradable 
beads, which are efficiently transported into the cells. 

20 In still another embodiment, a' composition of the present invention can 

be delivered via a particle bombardment approach, many of which have been described. 
In one illustrative example, gas-driven particle acceleration can be achieved with 
devices such as those manufactured by Powderject Pharmaceuticals PLC (Oxford, UK) 
and Powderject Vaccines Inc. (Madison, Wl), some examples of which are described in 

25 U.S. Patent Nos. 5,846,796; 6,010,478; 5,865,796; 5,584,807; and EP Patent No. 0500 
799. This approach offers a needle-free delivery ^proach wherein a dry powder 
formulation of microscopic particles, such as polynucleotide or polypeptide particles, 
are accelerated to high speed within a helium gas jet generated by a hand held device, 
propelling the particles into a target tissue of interest. 
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/ In a related embodiment, other devices and methods that may be useful 
for gas-driven needle-less injection of compositions of the present invention include 
those provided by Bioject, Inc. (Portland, OR), some examples of which are described 
in U.S. Patent Nos. 4,790,824; 5,064,413; 5,312,335; 5,383,851; 5399,163; 5,520,639 
5 and 5,993,412. 

According to another embodiment, the pharmaceutical compositions 
described herein will comprise one or more immunostimulants in addition to the 
immunogenic polynucleotide, polypeptide, antibody, T-cell and/or APC compositions 
of this invention. An immunostimulant refers to essentially any substance that enhances 

10 or potentiates an immune response (antibody and/or cell-mediated) to an exogenous 
antigen. One preferred type of immunostimulant comprises an adjuvant. Many 
adjuvants contain a substance designed to protect the antigen from rapid catabolism, 
such as aluminum hydroxide or mineral oil, and a stimulator of immune responses, such 
as lipid A, Bortadella pertussis or Mycobacterium tuberculosis derived proteins. 

15 Certain adjuvants are commercially available as, for example, Freund's Incomplete 
Adjuvant and Complete Adjuvant (Difco Laboratories, Detroit, MI); Merck Adjuvant 
65 (Merck and Company, Inc., Rahway, NJ); AS-2 (SmithKline Beecham, Philadelphia, 
PA); aluminum salts such as aluminum hydroxide gel (alum) or aluminum phosphate; 
salts of calcium, iron or zinc; an insoluble suspension of acylated tyrosine; acylated 

20 sugars; cationically or anionically derivatifced polysaccharides; polyphosphazenes; 
biodegradable microspheres; monophosphoryl lipid A and quil A. Cytokines, such as 
GM-CSF, interleukin-2, -7, -12, and other like growth factors, may also be used as 
adjuvants. 

Within certain, embodiments of the invention, the adjuvant composition 
25 is preferably one that induces an immune response predominantly of the Thl type. High 
levels of Thl-type cytokines (e.g., IFN-y, TNFo, IL-2 and IL-12) tend to favor the 
induction of cell mediated immune responses to an administered antigen. In contrast, 
high levels of Th2-type cytokines (e.g., H^4, DL-5, IL-6 and, DL-10) tend to favor the 
induction of humoral immune responses. Following application of a vaccine as 
30 provided herein, a patient will support an immune response that includes Thl- and Th2- 
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type resppnses. Within a preferred embodiment, in which a response is predominantly 
Thl-type, the level of Thl-type cytokines will increase to a greater extent than the level 
of Th2-type cytokines. The levels of these cytokines may be readily assessed using 
standard assays. For a review of the families of cytokines, see Mosmann and Coffman, 

5 Ann. Rev. Immunol. 7:145-173, 1989. 

Certain preferred adjuvants for eliciting a predominantly Thl-type 
response include, for example, a combination of monophosphoryl lipid A, preferably 3- 
de-O-acylated monophosphoryl lipid A, together with an aluminum salt. MPL® 
adjuvants are available from Corixa Corporation (Seattle, WA; see, for example, US 

10 Patent Nos. 4,436,727; 4,877,611; 4,866,034 and 4,912,094). CpG-containing 
oligonucleotides (in which the CpG dinucleotide is unmethylated) also induce a 
predominantly Thl response. Such oligonucleotides are well known and are described, 
for example, in WO 96/02555, WO 99/33488 and U.S. Patent Nos. 6,008,200 and 
5,856,462. Immunostimulatory DNA sequences are also described, for example, by 

15 Sato et al., Science 273:352, 1996. Another preferred adjuvant comprises a saponin, 
such as Quil A, or derivatives thereof, including QS21 and QS7 (Aquila 
Biopharmaceuticals Inc., Framingham, MA); Escin; Dighonin; or Gypsophila or 
Chenopodium quinoa saponins . Other preferred formulations include more than one 
saponin in the adjuvant combinations of the present invention, for example 

20 combinations of at least two of the following 'group comprising QS21, QS7, Quil A, p- 

escin, or digitonin. 

Alternatively the saponin formulations may be combined with vaccine 
vehicles composed of chitosan or other polycationic polymers, polylactide and 
polylactide-co-glycolide particles, poly-N-acetyl glucosamine-based polymer matrix, 

25 particles composed of polysaccharides or chemically modified polysaccharides, 
liposomes and lipid-based particles, particles composed of glycerol monoesters, etc. The 
saponins may also be formulated in the presence of cholesterol to form particulate 
structures such as liposomes or ISCOMs. Furthermore, the saponins may be formulated 
together with a polyoxyethylene ether or ester, in either a non-particulate solution or 

30 suspension, or in a particulate structure such as a paucilamelar liposome or ISCOM. The 
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saponins may also be formulated with excipients such as Carbopol R to increase 
viscosity, or may be formulated in a dry powder form with a powder excipient such as 
lactose. 

In one preferred embodiment, the adjuvant system includes the 
combination of a monophosphoryl lipid A and a saponin derivative, such as the 
combination of QS21 and 3D-MPL® adjuvant, as described in WO 94/00153, or a less 
reactogenic composition where the QS21 is quenched with cholesterol, as described in 
WO 96/33739. Other preferred formulations comprise an oil-in-water emulsion and 
tocopherol. Another particularly preferred adjuvant formulation employing QS21, 3D- 
MPL® adjuvant and tocopherol in an oil-in-water emulsion is described in WO 
95/17210. 

Another enhanced adjuvant system involves the combination of a CpG- 
containing oligonucleotide and a saponin derivative particularly the combination of 
CpG and QS21 is disclosed in WO 00/09159. Preferably the formulation additionally 
comprises an oil in water emulsion and tocopherol. 

Additional illustrative adjuvants for use in the pharmaceutical 
compositions of the invention include Montanide ISA 720 (Seppic, France), SAF 
(Chiron, California, United States), ISCOMS (CSL), MF-59 (Chiron), the SBAS series 
of adjuvants (e.g., SBAS-2 or SBAS-4, available from SmithKline Beecham, Rixensart, 
Belgium), Detox (Enhanzyn*) (Corixa, Hamilton, MT), RC-529 (Corixa, Hamilton, 
MT) and other aminoalkyl glucosaminide 4-phosphates (AGPs), such as those described 
in pending U.S. Patent Application Serial Nos. 08/853,826 and 09/074,720, the 
disclosures of which are incorporated herein by reference in their entireties, and 
polyoxyethylene ether adjuvants such as those described in WO 99/52549A1. 

Other preferred adjuvants include adjuvant molecules of the general 

formula 

(I): HO(CH 2 CH 2 0)n-A-R, 
wherein, n is 1-50, A is a bond or -C(0>, R is C,- 5 o alkyl or Phenyl Ci. 50 alkyl. 

One embodiment of the present invention consists of a vaccine 
formulation comprising a polyoxyethylene ether of general formula (I), wherein n is 



88 



WO 02/00174 



PCT/US01/21065 



between 1 and 50, preferably 4-24, most preferably 9; the R component is Ci. 50 , 
preferably C 4 -C 20 alkyl and most preferably C )2 alkyl, and A is a bond. The 
concentration of the polyoxyethylene ethers should be in the range 0.1-20%, preferably 
from 0.1-10%, and most preferably in the range 0.1-1%. Preferred polyoxyethylene 

5 ethers are selected from the following group: polyoxyethylene-9-lauryl ether, 
polyoxyethylene-9-steoryl ether, polyoxyethylene-8-steoryl ether, pory° x y eth y lene - 4 - 
lauryl ether, polyoxyethylene-35-lauryl ether, and polyoxyethylene-23-lauryl ether. 
Polyoxyethylene ethers such as polyoxyethylene lauryl ether are described in the Merck 
index (12 th edition: entry 7717). These adjuvant molecules are described in WO 

10 99/52549. 

The polyoxyethylene ether according to the general formula (I) above 
may, if desired, be combined with another adjuvant. For example, a preferred adjuvant 
combination is preferably with CpG as described in the pending UK patent application 
GB 9820956.2. 

15 According to another embodiment of this invention, an immunogenic 

composition described herein is delivered to a host via antigen presenting cells (APCs), 
such as dendritic cells, macrophages, B cells, monocytes and other cells that may be 
engineered to be efficient APCs. Such cells may, but need not, be genetically modified 
to increase the capacity for presenting the antigen, to improve activation and/or 

20 maintenance of the T cell response, to have anti-tumor effects per se and/or to be 
immunologically compatible with the receiver {i.e., matched HLA haplotype). APCs 
may generally be isolated from any of a variety of biological fluids and organs, 
including tumor and peritumoral tissues, and may be autologous, allogeneic, syngeneic 
or xenogeneic cells. 

25 Certain preferred embodiments of the present invention use dendritic 

cells or progenitors thereof as antigen-presenting cells. Dendritic cells are highly potent 
APCs (Banchereau and Steinman, Nature 5P2:245-251, 1998) and have been shown to 
be effective as a physiological adjuvant for eliciting prophylactic or therapeutic 
antitumor immunity (see Timmerman and Levy, Ann. Rev. Med 50:507-529, 1999). In 

30 general, dendritic cells may be identified based on their typical shape (stellate in situ, 
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with marked cytoplasmic processes (dendrites) visible in vitro), their ability to take up, 
process and present antigens with high efficiency and their ability to activate naive T 
cell responses. Dendritic cells may, of course, be engineered to express specific cell- 
surface receptors or ligands that are not commonly found on dendritic cells in vivo or ex 
vivo, and such modified dendritic cells are contemplated by the present invention. As 
an alternative to dendritic cells, secreted vesicles antigen-loaded dendritic cells (called 
exosomes) may be used within a vaccine (see Zitvogel et aL, Nature Med 4:594-600, 
1998). 

Dendritic cells and progenitors may be obtained from peripheral blood, 
bone marrow, tumor-infiltrating cells, peri tumoral tissues-infiltrating cells, lymph 
nodes, spleen, skin, umbilical cord blood or any other suitable tissue or fluid. For 
example, dendritic cells may be differentiated ex vivo by adding a combination of 
cytokines such as GM-CSF, EL-4, DL-13 and/or TOFct to cultures of monocytes 
harvested from peripheral blood. Alternatively, CD34 positive cells harvested from 
peripheral blood, umbilical cord blood or bone marrow may be differentiated into 
dendritic cells by adding to the culture medium combinations of GM-CSF, IL-3, TNFct, 
CD40 ligand, LPS, flt3 ligand and/or other comppund(s) that induce differentiation, 
maturation and proliferation of dendritic cells. 

Dendritic cells are conveniently categorized as "immature" and "mature" 
cells, which allows a simple way to discriminate between two well characterized 
phenotypes. However, this nomenclature should not be construed to exclude all 
possible intermediate stages of differentiation. Immature dendritic cells are 
characterized as APC with a high capacity for antigen uptake and processing, which 
correlates with the high expression of Fey receptor and mannose receptor. The mature 
phenotype is typically characterized by a lower expression of these markers, but a high 
expression of cell surface molecules responsible for T cell activation such as class I and 
class II MHC, adhesion molecules (e.g., CD54 and CD1 1) and costimulatory molecules 
(e.g., CD40, CD80, CD86 and 4-1BB). 

APCs may generally be transfected with a polynucleotide of the 
invention (or portion or other variant thereof) such that the encoded polypeptide, or an 
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immunogenic portion thereof, is expressed on the cell surface. Such transfection may 
take place ex vivo, and a pharmaceutical composition comprising such transfected cells 
may then be used for therapeutic purposes, as described herein. Alternatively, a gene 
delivery vehicle that targets a dendritic or other antigen presenting cell may be 
5 administered to a patient, resulting in transfection that occurs in vivo. In vivo and ex 
vivo transfection of dendritic cells, for example, may generally be performed using any 
methods known in the art, such as those described in WO 97/24447, or the gene gun 
approach described by Mahvi et al., Immunology and cell Biology 75:456-460, 1997. 
Antigen loading of dendritic cells may be achieved by incubating dendritic cells or 
10 progenitor cells with the tumor polypeptide, DNA (naked or within a plasmid vector) or 
KNA; or with antigen-expressing recombinant bacterium or viruses (e.g., vaccinia, 
fowlpox, adenovirus or lentivirus vectors). Prior to loading, the polypeptide may be 
covalently conjugated to an immunological partner that provides T cell help (e.g., a 
carrier molecule). Alternatively, a dendritic cell may be pulsed with a non-conjugated 
1 5 immunological partner, separately or in the presence of the polypeptide. 

While any suitable carrier known to those of ordinary skill in the art may 
be employed in the pharmaceutical compositions of this invention, the type of carrier 
will typically vary depending on the mode of administration. Compositions of the 
present invention may be formulated for any appropriate manner of administration, 
20 including for example, topical, oral, nasal, mucosal, intravenous, intracranial, 
intraperitoneal, subcutaneous and intramuscular administration. 

Carriers for use within such pharmaceutical compositions are 
biocompatible, and may also be biodegradable. In certain embodiments, the 
formulation preferably provides a relatively constant level of active component release. 
25 In other embodiments, however, a more rapid rate of release immediately upon 
administration may be desired. The formulation of such compositions is well within the 
level of ordinary skill in the art using known techniques. Illustrative carriers useful in 
this regard include microparticles of poly(lactide^o-glycoUde), polyaoylate, latex, 
starch, cellulose, dextran and the like. Other illustrative delayed-release carriers 
30 include supramolecular biovectors, which comprise a non-liquid hydrophilic core (e.g., 
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a cross-linked polysaccharide or oligosaccharide) and, optionally, an external layer 
comprising an amphiphilic compound, such as a phospholipid (see e.g., U.S. Patent No. 
5,151,254 and PCT applications WO 94/20078, WO/94/23701 and WO 96/06638). The 
amount of active compound contained within a sustained release formulation depends 
5 upon the site of implantation, the rate and expected duration of release and the nature of 
the condition to be treated or prevented. 

In another illustrative embodiment, biodegradable microspheres (e.g., 
polylactate polyglycolate) are employed as carriers for the compositions of this 
invention. Suitable biodegradable microspheres are disclosed, for example, in U.S. 

10 Patent Nos. 4,897,268; 5,075,109; 5,928,647; 5,811,128; 5,820,883; 5,853,763; 
5,814^44, 5,407,609 and 5,942,252. Modified hepatitis B core protein carrier systems, 
such as described in WO/99 40934, and references cited therein, will also be useful for 
many applications. Another illustrative carrier/delivery system employs a earner 
comprising particulate-protein complexes, such as those described in U.S. Patent No. 

15 5,928,647, which are capable of inducing a class I-restricted cytotoxic T lymphocyte 
responses in a host 

The pharmaceutical compositions of the invention will often further 
comprise one or more buffers (e.g., neutral buffered saline or phosphate buffered 
saline), carbohydrates (e.g., glucose, mannose, sucrose or dextrans), mannitol, proteins, 

20 polypeptides or amino acids such as glycihe, antioxidants, bacteriostats, chelating 
agents such as EDTA or glutathione, adjuvants (e.g., aluminum hydroxide), solutes that 
render the formulation isotonic, hypotonic or weakly hypertonic with the blood of a 
recipient, suspending agents, thickening agents and/or preservatives. Alternatively, 
compositions of the present invention may be formulated as a lyophilizate. 

25 The pharmaceutical compositions described herein may be presented in 

unit-dose or multi-dose containers, such as sealed ampoules or vials. Such containers 
are typically sealed in such a way to preserve the sterility and stability of the 
formulation until use. In general, formulations may be stored as suspensions, solutions 
or emulsions in oily or aqueous vehicles. Alternatively, a pharmaceutical composition 
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may be scored in a freeze-dried condition requiring only the addition of a sterile liquid 
carrier immediately prior to use. 

The development of suitable dosing and treatment regimens for using the 
particular compositions described herein in a variety of treatment regimens, including 
5 e.g., oral, parenteral, intravenous, intranasal, and intramuscular a<lministration and 
formulation, is well known in the art, some of which are briefly discussed below for 
general purposes of illustration. 

In certain applications, the pharmaceutical compositions disclosed herein 
may be delivered via oral administration to an animal. As such, these compositions 
10 may be formulated with an inert diluent or with an assimilable edible carrier, or they 
may be enclosed in hard- or soft-shell gelatin capsule, or they may be compressed into 
tablets, or they may be incorporated directly with the food of the diet. 

The active compounds may even be incorporated with excipients and 
used in the form of ingestible tablets, buccal tables, troches, capsules, elixirs, 
15 suspensions, syrups, wafers, and the like (see, for example, Mathiowitz et al, Nature 
1997 Mar 27;386(6623):410-4; Hwang et al, Crit Rev Ther Drug Carrier Syst 
1998;15(3):243-84; U. S. Patent 5,641,515; U. S. Patent 5,580,579 and U. S. Patent 
5,792,451). Tablets, troches, pills, capsules and the like may also contain any of a 
variety of additional components, for example, a binder, such as gum tragacanth, acacia, 
20 cornstarch, or gelatin; excipients, such as dic'alcium phosphate; a disintegrating agent, 
such as com starch, potato starch, alginic acid and the like; a lubricant, such as 
magnesium stearate; and a sweetening agent, such as sucrose, lactose or saccharin may 
be added or a flavoring agent, such as peppermint, oil of wintergreen, or cherry 
flavoring. When the dosage unit form is a capsule, it may contain, in addition to 
25 materials of the above type, a liquid carrier. Various other materials may be present as 
coatings or to otherwise modify the physical form of the dosage unit For instance, 
tablets, pills, or capsules may be coated with shellac, sugar, or both. Of course, any 
material used in preparing any dosage unit form should be pharmaceuticalty pure and 
substantially non-toxic in the amounts employed. In addition, the active compounds 
30 may be incorporated into sustained-release preparation and formulations. 
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Typically, these formulations will contain at least about 0.1% of the 
active compound or more, although the percentage of the active ingredient(s) may, of 
course, be varied and may conveniently be between about 1 or 2% and about 60% or 
70% or more of the weight or volume of the total formulation. Naturally, the amount of 
5 active compound(s) in each therapeutically useful composition may be prepared is such 
a way that a suitable dosage will be obtained in any given unit dose of the compound. 
Factors such as solubility, bioavailability, biological half-life, route of administration, 
product shelf life, as well as other pharmacological considerations will be contemplated 
by one skilled in the art of preparing such pharmaceutical formulations, and as such, a 

10 variety of dosages and treatment regimens may be desirable. 

For oral administration the compositions of the present invention may 
alternatively be incorporated with one or more excipients in the form of a mouthwash, 
dentifrice, buccal tablet, oral spray, or sublingual orally-administered formulation. 
Alternatively, the active ingredient may be incorporated into an oral solution such as 

15 one containing sodium borate, glycerin and potassium bicarbonate, or dispersed in a 
dentifrice, or added in a therapeutically-effective amount to a composition that may 
include water, binders, abrasives, flavoring agents, foaming agents, and humectants. 
Alternatively the compositions may be fashioned into a tablet or solution form that may 
be placed under the tongue or otherwise dissolved in the mouth. 

20 In certain circumstances it will be desirable to deliver the pharmaceutical 

compositions disclosed herein parenterally, intravenously, intramuscularly, or even 
intraperitoneally. Such approaches are well known to the skilled artisan, some of which 
are further described, for example, in U. S. Patent 5,543,158; U. S. Patent 5,641,515 
and U. S. Patent 5,399,363. In certain embodiments, solutions of the active compounds 

25 as free base or pharmacologically acceptable salts may be prepared in water suitably 
mixed with a surfactant, such as hydroxypropylcellulose. Dispersions may also be 
prepared in glycerol, liquid polyethylene glycols, and mixtures thereof and in oils. 
Under ordinary conditions of storage and use, these preparations generally will contain a 
preservative to prevent the growth of microorganisms. 
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Illustrative pharmaceutical forms suitable for injectable use include 
sterile aqueous solutions or dispersions and sterile powders for the extemporaneous 
preparation of sterile injectable solutions or dispersions (for example, see U. S. Patent 
5,466,468). In all cases the form must be sterile and must be fluid to the extent that 

5 easy syringability exists. It must be stable under the conditions of manufacture and 
storage and must be preserved against the contaminating action of microorganisms, 
such as bacteria and fungi. The carrier can be a solvent or dispersion medium 
containing, for example, water, ethanol, polyol (e.g., glycerol, propylene glycol, and 
liquid polyethylene glycol, and the like), suitable mixtures thereof, and/or vegetable 

10 oils. Proper fluidity may be maintained, for example, by the use of a coating, such as 
lecithin, by the maintenance of the required particle size in the case of dispersion and/or 
by the use of surfactants. The prevention of the action of microorganisms can be 
facilitated by various antibacterial and antifungal agents, for example, parabens, 
chlorobutanol, phenol, sorbic acid, thimerosal, and the like. In many cases, it will be 

15 preferable to include isotonic agents, for example, sugars or sodium chloride. 
Prolonged absorption of the injectable compositions can be brought about by the use in 
the compositions of agents delaying absorption, for example, aluminum monostearate 
and gelatin. 

In one embodiment, for parenteral administration in an aqueous solution, 
20 the solution should be suitably buffered if necessary and the liquid diluent first rendered 
isotonic with sufficient saline or glucose. These particular aqueous solutions are 
especially suitable for intravenous, intramuscular, subcutaneous and intraperitoneal 
administration. In this connection, a sterile aqueous medium that can be employed will 
be known to those of skill in the art in light of the present disclosure. For example, one 
25 dosage may be dissolved in 1 ml of isotonic NaCl solution and either added to 1000 ml 
of hypodermoclysis fluid or injected at the proposed site of infusion, (see for example, 
"Remington's Pharmaceutical Sciences" 15th Edition, pages 1035-1038 and 1570- 
1580). Some variation in dosage will necessarily occur depending on the condition of 
the subject being treated. Moreover, for human aciniinistration, preparations will of 
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course preferably meet sterility, pyrogenicity, and the general safety and purity 
standards as required by FDA Office of Biologies standards. 

In another embodiment of the invention, the compositions disclosed 
herein may be formulated in a neutral or salt form. Illustrative 
5 pharmaceutically-acceptable salts include the acid addition salts (formed with the free 
amino groups of the protein) and which are formed with inorganic acids such as, for 
example, hydrochloric or phosphoric acids, or such organic acids as acetic, oxalic, 
tartaric, mandelic, and the like. Salts formed with the free carboxyl groups can also be 
derived from inorganic bases such as, for example, sodium, potassium, ammonium, 

10 calcium, or ferric hydroxides, and such organic bases as isopropylamine, 
trimethylamine, histidine, procaine and the like. Upon formulation, solutions will be 
administered in a manner compatible with the dosage formulation and in such amount 
as is therapeutically effective. 

The carriers can further comprise any and all solvents, dispersion media, 

15 vehicles, coatings, diluents, antibacterial and antifungal agents, isotonic and absorption 
delaying agents, buffers, carrier solutions, suspensions, colloids, and the like. The use 
of such media and agents for pharmaceutical active substances is well known in the art. 
Except insofar as any conventional media or agent is incompatible with the active 
ingredient, its use in the therapeutic compositions is contemplated. Supplementary 

20 active ingredients . can also be incorporated into the compositions. The phrase 
"pharmaceutically-acceptable" refers to molecular entities and compositions, that do not 
produce an allergic or similar untoward reaction when administered to a human. 

In certain embodiments, the pharmaceutical compositions may be 
delivered by intranasal sprays, inhalation, and/or other aerosol delivery vehicles. 

25 Methods for delivering genes, nucleic acids, and peptide compositions directly to the 
lungs via nasal aerosol sprays has been described, e.g., in U. S. Patent 5,756,353 and U. 
S. Patent 5,804,212. Likewise, the delivery of drugs using intranasal microparticle 
resins (Takenaga et al, J Controlled Release 1998 Mar 2;52(l-2):81-7) and 
lysophosphatidyl-glycerol compounds (U. S. Patent 5,725,871) are also well-known in 
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the pharmaceutical arts. Likewise, illustrative transmucosal drug delivery in the form of 
a polytetrafluoroetheylene support matrix is described in U. S. Patent 5,780,045. 

In certain embodiments, liposomes, nanocapsules, microparticles, lipid 
particles, vesicles, and the like, are used for the introduction of the compositions of the 
5 present invention into suitable host cells/organisms. In particular, the compositions of 
the present invention may be formulated for delivery either encapsulated in a lipid 
particle, a liposome, a vesicle, a nanosphere, or a nanoparticle or the like. Alternatively, 
compositions of the present invention can be bound, either covalently or non-covalently, 
to the surface of such carrier vehicles. 
10 The formation and use of liposome and liposome-like preparations as 

potential drug carriers is generally known to those of skill in the art (see for example, 
.Lasic, Trends Biotechnol 1998 Jul;16(7):307-21; Takakura, Nippon Rinsho 1998 
Mar;56(3):691-5; Chandran et ah, Indian J Exp Biol. 1997 Aug;35(8):801-9; Margalit, 
Crit Rev Ther Drug Carrier Syst. 1995;12(2-3):233-61; U.S. Patent 5,567,434; U.S. 
15 Patent 5,552,157; U.S. Patent 5,565,213; U.S. Patent 5,738,868 and U.S. Patent 
5,795,587, each specifically incorporated herein by reference in its entirety). 

Liposomes have been used successfully with a number of cell types that 
are normally difficult to transfect by other procedures, including T cell suspensions, 
primary hepatocyte cultures and PC 12 cells (Renneisen et al, J Biol Chem. 1990 Sep 
20 25;265(27):16337-42; Muller et al, DNA Cell Biol. 1990 Apr;9(3):221-9). In addition, 
liposomes are free of the DNA length constraints that are typical of viral-based delivery 
systems. Liposomes have been used effectively to introduce genes, various drugs, 
radiotherapeutic agents, enzymes, viruses, transcription factors, allosteric effectors and 
the like, into a variety of cultured cell lines and animals. Furthermore, he use of 
25 liposomes does not appear to be associated with autoimmune responses or unacceptable 

toxicity after systemic delivery. 

In certain embodiments, liposomes are formed from phospholipids that 
are dispersed in an aqueous medium and spontaneously form multilamellar concentric 
bilayer vesicles (also termed multilamellar vesicles (MLVs). 
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/ Alternatively, in other embodiments, the invention provides for 
phaimaceutically-acceptable nanocapsule formulations of the compositions of the 
present invention. Nanocapsules can generally entrap compounds in a stable and 
reproducible way (see, for example, Quintanar-Guerrero et al, Drug Dev Ind Pharm. 
199» Dec;24(12):l 113-28). To avoid side effects due to intracellular polymeric 
overloading, such ultrafine particles (sized around 0.1 urn) may be designed using 
polymers able to be degraded in vivo. Such particles can be made as described, for 
example, by Couvreur etal. t Crit Rev Ther Drug Carrier Syst. 1988;5(l):l-20; zur 
Muhlen et al, Eur J Pharm Biopharm. 1998 Mar,45(2): 149-55; Zambaux et al J 
Controlled Release. 1998 Jan2;50(l-3):31-40; and U. S. Patent 5,145,684. 

Cancer Therapeutic Methods 

Immunologic approaches to cancer therapy are based on the recognition 
that cancer cells can often evade the body's defenses against aberrant or foreign cells 
and molecules, and that these defenses might be therapeutically stimulated to regain the 
15 lost ground, e.g. pgs. 623-648 in Klein, Immunology (Wiley-Interscience, New York, 
1982). Numerous recent observations that ^various immune effectors can directly or 
indirectly inhibit growth of tumors has led to renewed interest in this approach to cancer 
therapy, e.g. Jager, et al., Oncology 2001;60(l):l-7; Renner, et al., Ann Hematol 2000 
Dec;79(12):651-9. 

20 Four-basic cell types whose function has been associated with antitumor 

cell immunity and the elimination of tumor cells from the body are: i) B-lymphocytes 
which secrete immunoglobulins into the blood plasma for identifying and labeling the 
nonself invader cells; ii) monocytes which secrete the complement proteins that are 
responsible for lysing and processing the immunogjobulin-coated target invader cells; 

25 iii) natural killer lymphocytes having two mechanisms for the destruction of tumor 
cells, antibody-dependent cellular cytotoxicity and natural killing; and iv) T- 
lymphocytes possessing antigen-specific receptors and having ^e capacity to recognize 
a tumor cell carrying complementary marker molecules (Schreiber, H., 1989, in 
Fundamental Immunology (ed). W. E. Paul, pp. 923-955). 
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/ cancer immunotherapy generally focuses on inducing humoral immune 
responses, cellular immune responses, or both. Moreover, it is well established that 
induction of CD4 + T helper cells is necessary in order to secondarily induce either 
antibodies or cytotoxic CD8 + T cells. Polypeptide antigens that are selective or ideally 
5 specific for cancer cells, particularly lung cancer cells, offer a powerful approach for 
inducing immune responses against lung cancer, and are an important aspect of the 
present invention. 

Therefore, in further aspects of the present invention, the pharmaceutical 
compositions described herein may be used for the treatment of cancer, particularly for 

10 the immunotherapy of lung cancer. Within such methods, the pharmaceutical 
compositions described herein are administered to a patient, typically a warm-blooded 
animal, preferably a human. A patient may or may not be afflicted with cancer. 
Accoraingly, the above pharmaceutical compositions may be used to prevent the 
development of a cancer or to treat a patient afflicted with a cancer. Pharmaceutical 

15 compositions and vaccines may be administered either prior to or following surgical 
removal of primary tumors and/or treatment such as administration of radiotherapy or 
conventional chemotherapeutic drugs. As discussed above, administration of the 
pharmaceutical compositions may be by any suitable method, including administration 
by intravenous, intraperitoneal, intramuscular, subcutaneous, intranasal, intradermal, 

20 anal, vaginal, topical and oral routes. 

Within certain embodiments, immunotherapy may be active 
immunotherapy, in which treatment relies on theTn vfvo stimulation of the endogenous 
host immune system to react against tumors with the administration of immune 
response-modifying agents (such as polypeptides and polynucleotides as provided 

25 herein). 

Within other embodiments, immunotherapy may be passive 
immunotherapy, in which treatment involves the delivery of agents with established 
tumor-immune reactivity (such as effector cells or antibodies) that can directly or 
indirectly mediate antitumor effects and does not necessarily depend on an intact host 
30 immune system. Examples of effector cells include T cells as discussed above, T 
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lymphocytes (such as CD8 + cytotoxic T lymphocytes and CD4 + T-helper tumor- 
infiltrating lymphocytes), killer cells (such as Natural Killer cells and lymphokine- 
activated killer cells), B cells and antigen-presenting cells (such as dendritic cells and 
macrophages) expressing a polypeptide provided herein. T cell receptors and antibody 
5 receptors specific for the polypeptides recited herein may be cloned, expressed and 
transferred into other vectors or effector cells for adoptive immunotherapy. The 
polypeptides provided herein may also be used to generate antibodies or anti-idiotypic 
antibodies (as described above and in U.S. Patent No. 4,918,164) for passive 
immunotherapy. 

1° Monoclonal antibodies may be labeled with any of a variety of labels for 

desired selective usages in detection, diagnostic assays or therapeutic applications (as 
described in U.S. Patent Nos. 6,090,365; 6,015,542; 5,843,398; 5,595,721; and 
4,708,930, hereby incorporated by reference in their entirety as if each was incorporated 
individually). In each case, the binding of the labelled monoclonal antibody to the 

15 determinant site of the antigen will signal detection or delivery of a particular 
therapeutic agent to the antigenic determinant on the non-normal cell. A further object 
of this invention is to provide the specific monoclonal antibody suitably labelled for 
achieving such desired selective usages thereof. 

Effector cells may generally be obtained in sufficient quantities for 

20 adoptive immunotherapy by growth in vitro, ak described herein. Culture conditions for 
expanding single antigen-specific effector cells to several billion in number with 
retention of antigen recognition in vivo are well known in the art Such in vitro culture 
conditions typically use intermittent stimulation with antigen, often in the presence of 
cytokines (such as IL-2) and non-dividing feeder cells. As noted above, 

25 immunoreactive polypeptides as provided herein may be used to rapidly expand 
antigen-specific T cell cultures in order to generate a sufficient number of cells for 
immunotherapy. In particular, antigen-presenting cells, such as dendritic, macrophage, 
monocyte, fibroblast and/or B cells, may be pulsed with imrqunoreactive polypeptides 
or transfected with one or more polynucleotides using standard techniques well known 

30 in the art For example, antigen-presenting cells can be transfected with a 
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polynucleotide having a promoter appropriate for increasing expression in a 
recombinant virus or other expression system. Cultured effector cells for use in therapy 
must be able to grow and distribute widely, and to survive long term in vivo. Studies 
have shown that cultured effector cells can be induced to grow in vivo and to survive 
5 long term in substantial numbers by repeated stimulation with antigen supplemented 
with IL-2 (see, for example, Cheever et al., Immunological Reviews 157:1V, 1997). 

Alternatively, a vector expressing a polypeptide recited herein may be 
introduced into antigen presenting cells taken from a patient and clonally propagated ex 
vivo for transplant back into the same patient Transfected ceUs may be reintroduced 
10 into the patient using any means known in the art, preferably in sterile form by 
intravenous, intracavitary, intraperitoneal or intratumor administration. 

Routes and frequency of administration of the therapeutic compositions 
described herein, as well as dosage, will vary from individual to individual, and may be 
readily established using standard techniques. In general, the pharmaceutical 
15 compositions and vaccines may be administered by injection (e.g., intracutaneous, 
intramuscular, intravenous or subcutaneous), intranasally (e.g., by aspiration) or orally. 
Preferably, between 1 and 10 doses may be administered over a 52 week period. 
Preferably, 6 doses are administered, at intervals of 1 month, and booster vaccinations 
may be given periodically thereafter. Alternate protocols may be appropriate for 
20 individual patients. A suitable dose is an amount of a compound that, when 
administered as described above, is capable of promoting an anti-tumor immune 
response, and is at least 10-50% above the basal (i.e., untreated) level. Such response 
can be monitored by measuring the anti-tumor antibodies in a patient or by vaccine- 
dependent generation of cytolytic effector cells capable of killing the patient's tumor 
25 cells in vitro. Such vaccines should also be capable of causing an immune response that 
leads to an improved clinical outcome (e.g., more frequent remissions, complete or 
partial or longer disease-free survival) in vaccinated patients as compared to non- 
vaccinated patients. In general, for pharmaceutical compositions and vaccines 
comprising one or more polypeptides, the amount of each polypeptide present in a dose 
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ranges from about 25 ng to 5 mg per kg of host. Suitable dose sizes will vary with the 
size of the patient, but will typically range from about 0.1 mL to about 5 mL. 

In general, an appropriate dosage and treatment regimen provides the 
active compound(s) in an amount sufficient to provide therapeutic and/or prophylactic 
5 benefit. Such a response can be monitored by establishing an improved clinical 
outcome (e.g., more frequent remissions, complete or partial, or longer disease-free 
survival) in treated patients as compared to non-treated patients. Increases in 
preexisting immune responses to a tumor protein generally correlate with an improved 
clinical outcome. Such immune responses may generally be evaluated using standard 
10 proliferation, cytotoxicity or cytokine assays, which may be performed using samples 
obtained from a patient before and after treatment 

Cancer Detection and Diagnostic Compositions, Methods and Kits 

In general, a cancer may be detected in a patient based on the presence of 
one or more lung tumor proteins and/or polynucleotides encoding such proteins in a 

15 biological sample (for example, blood, sera, sputum urine and/or tumor biopsies) 
obtained from the patient. In other words, such proteins may be used as markers to 
indicate the presence or absence of a cancer such as lung cancer. In addition, such 
proteins may be useful for the detection of other cancers. The binding agents provided 
herein generally permit detection of the level' of antigen that binds to the agent in the 

20 biological sample. 

Polynucleotide primers and probes may be used to detect the level of 
mRNA encoding a tumor protein, which is also indicative of the presence or absence of 
a cancer. In general, a tumor sequence should be present at a level that is at least two- 
fold, preferably three-fold, and more preferably five-fold or higher in tumor tissue than 
25 in normal tissue of the same type from which the tumor arose. Expression levels of a 
particular tumor sequence in tissue types different from that in which the tumor arose 
are irrelevant in certain diagnostic embodiments since the presence of tumor cells can 
be confirmed by observation of predetermined differential expression levels, e.g., 2- 
fold, 5-fold, etc, in tumor tissue to expression levels in normal tissue of the same type. 



102 



WO 02/00174 



PCT/US01/21065 



Other differential expression patterns can be utilized advantageously for 
diagnostic purposes. For example, in one aspect of the invention, overexpression of a 
tumor sequence in tumor tissue and normal tissue of the same type, but not in other 
normal tissue types, eg. PBMCs, can be exploited diagnostkally. In this case, the 

5 presence of metastatic tumor cells, for example in a sample taken from the circulation 
or some other tissue site different from that in which the tumor arose, can be identified 
and/or confirmed by detecting expression of the tumor sequence in the sample, for 
example using RT-PCR analysis. In many instances, it will be desired to enrich for 
tumor cells in the sample of interest, e.g., PBMCs, using cell capture or other like 

10 techniques. 

There are a variety of assay formats known to those of ordinary skill in 
the art for using a binding agent to detect polypeptide markers in a sample. See, e.g., 
Harlow and Lane, Antibodies: A Laboratory Manual, Cold Spring Harbor Laboratory, 
1988. In general, the presence or absence of a cancer in a patient may be determined by 
15 (a) contacting a biological sample obtained from a patient with a binding agent; (b) 
' detecting in the sample a level of polypeptide that binds to the binding agent; and (c) 
comparing the level of polypeptide with a predetermined cut-off value. 

In a preferred embodiment, the assay involves the use of binding agent 
immobilized on a solid support to bind to and remove the polypeptide from the 
20 remainder of the sample. The bound polypeptide may then be detected using a detection 
reagent that contains a reporter group and specifically binds to the binding 
agent/polypeptide complex. Such detection reagents may comprise, for example, a 
binding agent that specifically binds to the polypeptide or an antibody or other agent 
that specifically binds to the binding agent, such as an anti-immunoglobulin, protein G, 
25 protein A or a lectin. Alternatively, a competitive assay may be utilized, in which a 
polypeptide is labeled with a reporter group and allowed to bind to the irnmobilized 
binding agent after incubation of the binding agent with the sample. The extent to 
which components of the sample inhibit the binding of the labeled polypeptide to the 
binding agent is indicative of the reactivity of the sample with the immobilized binding 
30 agent. Suitable polypeptides for use within such assays include full length lung tumor 
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proteins £nd polypeptide portions thereof to which the binding agent binds, as described 
above. 

'i 

The solid support may be any material known to those of ordinary skill 
in the art to which the tumor protein may be attached. For example, the solid support 
5 may be a test well in a microtiter plate or a nitrocellulose or other suitable membrane. 
Alternatively, the support may be a bead or disc, such as glass, fiberglass, latex or a 
plastic material such as polystyrene or polyvinylchloride. The support may also be a 
magnetic particle or a fiber optic sensor, such as those disclosed, for example, in U.S. 
Patent No. 5,359,681. The binding agent may be immobilized on the solid support 
10 using a variety of techniques known to those of skill in the art, which are amply 
described in the patent and scientific literature. In the context of the present invention, 
the term "immobilization" refers to both noncovalent association, such as adsorption, 
and covalent attachment (which may be a direct linkage between the agent and 
functional groups on the support or may be a linkage by way of a cross-linking agent). 
15 Immobilization by adsorption to a well in a microtiter plate or to a membrane is 
preferred. In such cases, adsorption may be achieved by contacting the binding agent, in 
a suitable buffer, with the solid support for a suitable amount of time. The contact time 
varies with temperature, but is typically between about 1 hour and about 1 day. In 
general, contacting a well of a plastic microtiter plate (such as polystyrene or 
20 polyvinylchloride) with an amount of binding agent ranging from about 10 ng to about 
10 ng, and preferably about 100 ng to about 1 \xg, is sufficient to immobilize an 
adequate amount of binding agent. 

Covalent attachment of binding agent to a solid support may generally be 
achieved by first reacting the support with a Afunctional reagent that will react with 
25 both the support and a functional group, such as a hydroxyl or amino group, on the 
binding agent. For example, the binding agent may be covalently attached to supports 
having an appropriate polymer coating using benzoquinone or by condensation of an 
aldehyde group on the support with an amine and an active hydrogen on the binding 
partner (see, e.g., Pierce Immunotechnology Catalog and Handbook, 1991, at 
30 A12-A13). 
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In certain embodiments, the assay is a two-antibody sandwich assay. 
This assay may be performed by first contacting an antibody that has been immobilized 
on a solid support, commonly the well of a microliter plate, with the sample, such that 
polypeptides within the sample are allowed to bind to the immobilized antibody. 

5 Unbound sample is then removed from the immobilized polypeptide-antibody 
complexes and a detection reagent (preferably a second antibody capable of binding to a 
different site on the polypeptide) containing a reporter group is added. The amount of 
detection reagent that remains bound to the solid support is then determined using a 
method appropriate for the specific reporter group. 

10 More specifically, once the antibody is immobilized on the support as 

described above, the remaining protein binding sites on the support are typically 
blocked. Any suitable blocking agent known to those of ordinary skill in the art, such as 
bovine serum albumin or Tween 20™ (Sigma Chemical Co., St. Louis, MO). The 
immobilized antibody is then incubated with the sample, and polypeptide, is allowed to 

15 bind to the antibody. The sample may be diluted with a suitable diluent, such as 
phosphate-buffered saline (PBS) prior to incubation. In general, an appropriate contact 
time (/.&, incubation time) is a period of time that is sufficient to detect the presence of 
polypeptide within a sample obtained from an individual with lung cancer. Preferably, 
the contact time is sufficient to achieve a level of binding that is at least about 95% of 

20 that achieved at equilibrium between bound and unbound polypeptide. Those of 
ordinary skill in the art will recognize that the time necessary to achieve equilibrium 
may be readily determined by assaying the level of binding that occurs over a period of 
time. At room temperature, an incubation time of about 30 minutes is generally 
sufficient 

25 Unbound sample may then be removed by washing the solid support 

with an appropriate buffer, such as PBS containing 0.1% Tween 20™. The second 
antibody, which contains a reporter group, may then be added to the solid support. 
Preferred reporter groups include those groups recited above. , 

The detection reagent is then incubated with the immobilized antibody- 

30 polypeptide complex for an amount of time sufficient to detect the bound polypeptide. 
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An appropriate amount of time may generally be determined by assaying the level of 
binding that occurs over a period of time. Unbound detection reagent is then removed 
and bound detection reagent is detected using the reporter group. The method employed 
for detecting the reporter group depends upon the nature of the reporter group. For 
5 radioactive groups, scintillation counting or autc Jjographic methods are generally 
appropriate. Spectroscopic methods may be used to detect dyes, luminescent groups 
and fluorescent groups. Biotin may be detected using avidin, coupled to a different 
reporter group (commonly a radioactive or fluorescent group or an enzyme). Enzyme 
reporter groups may generally be detected by the addition of substrate (generally for a 
• 10 specific period of time), followed by spectroscopic or other analysis of the reaction 
products. 

To determine the presence or absence of a cancer, such as lung cancer, 
the signal detected from the reporter group that remains bound to the solid support is 
generally compared to a signal that corresponds to a predetermined cut-off value. In 

15 one preferred embodiment, the cut-off value for the detection of a cancer is the average 
mean signal obtained when the immobilized antibody is incubated with samples from 
patients without the cancer. In general, a sample generating a signal that is three 
standard deviations above the predetermined cut-off value is considered positive for the 
cancer. In an alternate preferred embodiment, the cut-off value is determined using a 

20 Receiver Operator Curve, according to the method of Sackett et al., Clinical 
Epidemiology: A Basic Science for Clinical Medicine, Little Brown and Co., 1985, 
p. 106-7. Briefly, in this embodiment, the cut-off value may be determined from a plot 
of pairs of true positive rates (ie. 9 sensitivity) and false positive rates (100%-specificity) 
that correspond to each possible cut-off value for the diagnostic test result. The cut-off 

25 value on the plot that is the closest to the upper left-hand corner (i.e. 9 the value that 
encloses the largest area) is the most accurate cut-off value, and a sample generating a 
signal that is higher than the cut-off value determined by this method may be considered 
positive. Alternatively, the cut-off value may be shifted to ^he left along the plot, to 
minimize the false positive rate, or to the right, to minimize the false negative rate. In 
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general, a-sample generating a signal that is higher than the cut-off value determined by 
this method is considered positive for a cancer. 

In a related embodiment, the assay is performed in a flow-through or 
strip test format, wherein the binding agent is immobilized on a membrane, such as 
5 nitroceUulose. In the flow-through test, polypeptides within the sample bind to the 
immobilized binding agent as the sample passes through the membrane. A second, 
labeled binding agent then binds to the binding agent-polypeptide complex as a solution 
containing the second binding agent flows through the membrane. The detection of 
bound second binding agent may then be performed as described above. In the strip test 
10 format, one end of the membrane to which binding agent is bound is immersed in a 
solution containing the sample. The sample migrates along the membrane through a 
region containing second binding agent and to the area of immobilized binding agent. 
Concentration of second binding agent at the area of immobilized antibody indicates the 
presence of a cancer. Typically, the concentration of second binding agent at that site 
15 generates a pattern, such as a line, that can be read visually. The absence of such a 
pattern indicates a negative result. In general, the amount of binding agent immobilized 
on the membrane is selected to generate a visually discernible pattern when the 
biological sample contains a level of polypeptide that would be sufficient to generate a 
positive signal in the two-antibody sandwich assay, in the format discussed above. 
20 Preferred binding agents for use in such assays are antibodies and antigen-binding 
fragments thereof. Preferably, the amount of antibody immobilized on the membrane 
ranges from about 25 ng to about lug, and more preferably from about 50 ng to about 
500 ng. Such tests can typically be performed with a very small amount of biological 
sample. 

25 Of course, numerous other assay protocols exist that are suitable for use 

with the tumor proteins or binding agents of the present invention. The above 
descriptions are intended to be exemplary only. For example, it will be apparent to 
those of ordinary skill in the art that the above protocols may he readily modified to use 
tumor polypeptides to detect antibodies that bind to such polypeptides in a biological 
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sample. The detection of such tumor protein specific antibodies may correlate with the 
presence of a cancer. 

A cancer may also, or alternatively, be detected based on the presence of 
T cells that specifically react with a tumor protein in a biological sample. Within 
5 certain methods, a biological sample comprising CD4 + and/or CD8 + T cells isolated 
from a patient is incubated with a tumor polypeptide, a polynucleotide encoding such a 
polypeptide and/or an APC that expresses at least an immunogenic portion of such a 
polypeptide, and the presence or absence of specific activation of the T cells is detected. 
Suitable biological samples include, but are not limited to, isolated T cells. For 

10 example, T cells may be isolated from a patient by routine techniques (such as by 
Ficoll/Hypaque density gradient centrifugation of peripheral blood lymphocytes). T 
cells may be incubated in vitro for 2-9 days (typically 4 days) at 37°C with polypeptide 
(e.g., 5 - 25 jig/ml). It may be desirable to incubate another aliquot of a T cell sample in 
the absence of tumor polypeptide to serve as a control. For CD4 + T cells, activation is 

15 preferably detected by evaluating proliferation of the T cells. For CD8 + T cells, 
activation is preferably detected by evaluating cytolytic activity. A level of proliferation 
that is at least two fold greater and/or a level of cytolytic activity that is at least 20% 
greater than in disease-free patients indicates the presence of a cancer in the patient. 

As noted above, a cancer may also, or alternatively, be detected based on 

20 the level of mRNA encoding a tumor proteiri in a biological sample. For example, at 
least two oligonucleotide primers may be employed in a polymerase chain reaction 
(PCR) based assay to amplify a portion of a tumor cDNA derived from a biological 
sample, wherein at least one of the oligonucleotide primers is specific for (i.e., 
hybridizes to) a polynucleotide encoding the tumor protein. The amplified cDNA is 

25 then separated and detected using techniques well known in the art, such as gel 
electrophoresis. 

Similarly, oligonucleotide probes that specifically hybridize to a 
polynucleotide encoding a tumor protein may be used in a hybridization assay to detect 
the presence of polynucleotide encoding the tumor protein in a biological sample. 
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To permit hybridization under assay conditions, oligonucleotide primers 
and probes should comprise an oligonucleotide sequence that has at least about 60%, 
preferably at least about 75% and more preferably at least about 90%, identity to a 
portion of a polynucleotide encoding a tumor protein of the invention that is at least 10 

5 nucleotides, and preferably at least 20 nucleotides, in length. Preferably, 
oligonucleotide primers and/or probes hybridize to a polynucleotide encoding a 
polypeptide described herein under moderately stringent conditions, as defined above. 
Oligonucleotide primers and/or probes which may be usefully employed in the 
diagnostic methods described herein preferably are at least 10-40 nucleotides in length. 

10 In a preferred embodiment, the oligonucleotide primers comprise at least 10 contiguous 
nucleotides, more preferably at least 15 contiguous nucleotides, of a DNA molecule 
having a sequence as disclosed herein. Techniques for both PCR based assays and 
hybridization assays are well known in the art {see, for example, Mullis et al., Cold 
Spring Harbor Symp. Quant. Biol, 57:263, 1987; Erlich ed., PCR Technology, Stockton 

15 Press, NY, 1989). 

One preferred assay employs RT-PCR, in which PCR is applied in 
conjunction with reverse transcription. Typically, RNA is extracted from a biological 
sample, such as biopsy tissue, and is reverse transcribed to produce cDNA molecules. 
PCR amplification using at least one specific primer generates a cDNA molecule, which 

20 may be separated and visualized using, for example, gel electrophoresis. Amplification 
may be performed on biological samples taken from a test patient and from an 
individual who is not afflicted with a cancer. The amplification reaction may be 
performed on several dilutions of cDNA spanning two orders of magnitude. A two-fold 
or greater increase in expression in several dilutions of the test patient sample as 

25 compared to the same dilutions of the non-cancerous sample is typically considered 
positive. 

In another aspect of the present invention, cell capture technologies may 
be used in conjunction, with, for example, real-time PCR to ( provide a more sensitive 
tool for detection of metastatic cells expressing lung tumor antigens. Detection of lung 
30 cancer cells in biological samples, e.g., bone marrow samples, peripheral blood, and 
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small needle aspiration samples is desirable for diagnosis and prognosis in lung cancer 
patients. 

Immunomagnetic beads coated with specific monoclonal antibodies to 
surface cell markers, or tetrameric antibody complexes, may be used to first enrich or 
5 positively select cancer cells in a sample. Various commercially available kits may be 
used, including Dynabeads® Epithelial Enrich (Dynal Biotech, Oslo, Norway), 
StemSep™ (StemCell Technologies, Inc., Vancouver, BC), and RosetteSep (StemCell 
Technologies). A skilled artisan will recognize that other methodologies and kits may 
also be used to enrich or positively select desired cell populations. Dynabeads® 

10 Epithelial Enrich contains magnetic beads coated with mAbs specific for two 
glycoprotein membrane antigens expressed on normal and neoplastic epithelial tissues. 
The coated beads may be added to a sample and the sample then applied to a magnet, 
thereby capturing the cells bound to the beads. The unwanted cells are washed away 
and the magnetically isolated cells eluted from the beads and used in further analyses. 

15 RosetteSep can be used to enrich cells directly from a blood sample and 

consists of a cocktail of tetrameric antibodies that targets a variety of unwanted cells 
and crosslinks them to glycophorin A on red blood cells (RBC) present in the sample, 
forming rosettes. When centrifuged over Ficoll, targeted cells pellet along with the free 
RBC. The combination of antibodies in the depletion cocktail determines which cells 

20 will be removed and consequently which cells will be recovered. Antibodies that are 
available include, but are not limited to: CD2, CD3, CD4, CD5, CD8, CD10, CDllb, 
CD14, CD15, CD16, CD19, CD20, CD24, CD25, CD29, CD33, CD34, CD36, CD38, 
CD41, CD45, CD45RA, CD45RO, CD56, CD66B, CD66e, HLA-DR, IgE, and TCRcxp. 

Additionally, it is contemplated in the present invention that mAbs 

25 specific for lung tumor antigens can be generated and used in a similar manner. For 
example, mAbs that bind to tumor-specific cell surface antigens may be conjugated to 
magnetic beads, or formulated in a tetrameric antibody complex, and used to enrich or 
positively select metastatic lung tumor cells from a sample. Once a sample is enriched 
or positively selected, cells may be lysed and RNA isolated. RNA may then be 

30 subjected to RT-PCR analysis using lung tumor-specific primers in a real-time PCR 
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assay as described herein. One skilled in the art will recognize that enriched or selected 
populations of cells may be analyzed by other methods (e.g. in situ hybridization or 
flow cytometry). 

In another embodiment, the compositions described herein may be used 
5 as markers for the progression of cancer. In this embodiment, assays as described above 
for the diagnosis of a cancer may be performed over time, and the change in the level of 
reactive polypeptide(s) or polynucleotide(s) evaluated For example, the assays may be 
performed every 24-72 hours for a period of 6 months to 1 year, and thereafter 
performed as needed. In general, a cancer is progressing in those patients in whom the 

10 level of polypeptide or polynucleotide detected increases over time. In contrast, the 
cancer is not progressing when the level of reactive polypeptide or polynucleotide either 
remains constant or decreases with time. 

Certain in vivo diagnostic assays may be performed directly on a tumor. 
One such assay involves contacting tumor cells with a binding agent. The bound 

15 binding agent may then be detected directly or indirectly via a reporter group. Such 
binding agents may also be used in histological applications. Alternatively, 
polynucleotide probes may be used within such applications. 

As noted above, to improve sensitivity, multiple tumor protein markers 
may be assayed within a given sample. It will be apparent that binding agents specific 

20 for different proteins provided herein may be 1 combined within a single assay. Further, 
multiple primers or probes may be used concurrently. The selection of tumor protein 
markers may be based on routine experiments to determine combinations that results in 
optimal sensitivity. In addition, or alternatively, assays for tumor proteins provided 
herein may be combined with assays for other known tumor antigens. 

25 The present invention further provides kits for use within any of the 

above diagnostic methods. Such kits typically comprise two or more components 
necessary for performing a diagnostic assay. Components may be compounds, reagents, 
containers and/or equipment. For example, one container within a kit may contain a 
monoclonal antibody or fragment thereof that specifically binds to a tumor protein. 

30 Such antibodies or fragments may be provided attached to a support material, as 
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described above. One or more additional containers may enclose elements, such as 
reagents or buffers, to be used in the assay. Such kits may also, or alternatively, contain 
a detection reagent as described above that contains a reporter group suitable for direct 
or indirect detection of antibody binding. 
5 Alternatively, a kit may be designed to detect the level of mRNA 

encoding a tumor protein in a biological sample. Such kits generally comprise at least 
one oligonucleotide probe or primer, as described above, that hybridizes to a 
polynucleotide encoding a tumor protein. Such an oligonucleotide may be used, for 
example, within a PCR or hybridization assay. Additional components that may be 
1 0 present within such kits include a second oligonucleotide and/or a diagnostic reagent or 
container to facilitate the detection of a polynucleotide encoding a tumor protein. 

The following Examples are offered by way of illustration and not by 
wa y of limitation. 
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EXAMPLES 
EXAMPLE 1 

ISOLATION AN D CHARACTERIZATION OF cDNA SEQUE NCES ENCODING 
5 LUNG TUMOR POLYPEPTIDES 

This example illustrates the isolation of cDNA molecules encoding lung 
tumor-specific polypeptides from lung tumor cDNA libraries. 



A. ISOLATION OF cDNA SEQUENCES FROM A LUNG SQUAMOUS CELL 
CARCINOMA LIBRARY 

10 A human lung squamous cell carcinoma cDNA expression library was 

constructed from poly A + RNA from a pool of two patient tissues using a Superscript 
Plasmid System for cDNA Synthesis and Plasmid Cloning kit (BRL Life Technologies, 
Gaithersburg, MD) following the manufacturer's protocol. Specifically, lung carcinoma 
tissues were homogenized with polytron (Kinematica, Switzerland) and total RNA was 

1 5 extracted using Trizol reagent (BRL Life Technologies) as directed by the manufacturer. 
The poly A + RNA was then purified using an oligo dT cellulose column as described in 
Sambrook et al., Molecular Cloning: A Laboratory Manual, Cold Spring Harbor 
Laboratories, Cold Spring Harbor, NY, 1989. First-strand cDNAwas synthesized using 
the NotI/01igo-dT18 primer. Double-stranded cDNA was synthesized, ligated with 

20 BstXI/EcoRI adaptors (Invitrogen, San Diego, CA) and digested with NotL Following 
size fractionation with cDNA size fractionation columns (BRL Life Technologies), the 
cDNA was ligated into the BstXI/NotI site of pcDNA3.1 (Invitrogen) and transformed 
into ElectioMax E. coli DH10B cells (BRL Life Technologies) by electroporation. 

Using the same procedure, a normal human lung cDNA expression 

25 library was prepared from a pool of four tissue specimens. The cDNA libraries were 
characterized by deternuning the number of independent colonies, the percentage of 
clones that carried insert, the average insert size and by sequence analysis. The lung 
squamous cell carcinoma library contained 2.7 x 10 6 independent colonies, with 100% 
of clones having an insert and the average insert size being 2100 base pairs. The normal 
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lung cDNA library contained 1.4 x 10 6 independent colonies, with 90% of clones 
having inserts and the average insert size being 1800 base pairs. For both libraries, 
sequence analysis showed that the majority of clones had a full length cDNA sequence 
and were synthesized from mRNA 
5 cDNA library subtraction was performed using the above lung squamous 

cell carcinoma and normal lung cDNA libraries, as described by Hara et al {Blood, 
£4:189-199, 1994) with some modifications. Specifically, a lung squamous cell 
carcinoma-specific subtracted cDNA library was generated as follows. Normal tissue 
cDNA library (80 *ig) was digested with BamHI and Xhol, followed by a filling-in 
10 reaction with DNA polymerase Klenow fragment. After phenol-chloroform extraction 
and ethanol precipitation, the DNA was dissolved in 133 \x\ of H 2 0, heat-denatured and 
mixed with 133 \A (133 ng) of Photoprobe biotin (Vector Laboratories, Burlingame, 
CA). As recommended by the manufacturer, the resulting mixture was irradiated with a 
270 W sunlamp on ice for 20 minutes. Additional Photoprobe biotin (67 yS) was added 

15 and the biotinylation reaction was repeated. After extraction with butanol five times, 
the DNA was ethanol-precipitated and dissolved in 23 jil H 2 0 to form the driver DNA. 

To form the tracer DNA, 10 jig lung squamous cell carcinoma cDNA 
library was digested with NotI and Spel, phenol chloroform extracted and passed 
through Chroma spin-400 columns (Clontech, Palo Alto, CA). typically, 5 \ig of 

20 cDNA was recovered after the sizing coluriua. Following ethanol precipitation, the 
tracer DNA was dissolved in 5 ^1 H 2 0. Tracer DNA was mixed with 15 \il driver DNA 
and 20 yl of 2 x hybridization buffer (1.5 M NaCl/10 mM EDTA/50 mM HEPES pH 
7.5/0.2% sodium dodecyl sulfate), overlaid with mineral oil, and heat-denatured 
completely. ;The sample was immediately transferred into a 68 °C water bath and 

25 incubated for 20 hours (long hybridization [EH]). The reaction mixture was then 
subjected to a streptavidin treatment followed by phenol/chloroform extraction. This 
process was repeated three more times. Subtracted DNA was precipitated, dissolved in 
12 \xl H 2 0, mixed with 8 jd driver DNA and 20 \xl of 2 x hybridization buffer, and 
subjected to a hybridization at 68 °C for 2 hours (short hybridization [SH]). After 

30 removal of biotinylated double-stranded DNA, subtracted cDNA was ligated into 
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NouVSpel site of chloramphenicol resistant pBCSK + (Stratagene, La Jolla, CA) and 
transformed into ElectroMax E. coli DH10B cells by electroporation to generate a lung 
squamous cell carcinoma specific subtracted cDNA library (herein after referred to as 
"lung subtraction I"). 

5 A second lung squamous cell carcinoma specific subtracted cDNA 

library (referred to as "lung subtraction IT) was generated in a similar way to the lung 
subtraction library I, except that eight frequently recovered genes from lung subtraction 
I were included in the driver DNA, and 24,000 independent clones were recovered. 

To analyze the subtracted cDNA libraries, plasmid DNA was prepared 
10 from 320 independent clones, randomly picked from the subtracted lung squamous cell 
carcinoma specific libraries. Representative cDNA clones were further characterized by 
DNA sequencing with a Peikin Elmer/Applied Biosystems Division Automated 
Sequencer Model 373A and/or Model 377 (Foster City, CA). The cDNA sequences for 
sixty isolated clones are provided in SEQ ID NO: 1-60. These sequences were 
15 compared to known sequences in the gene bank using the EMBL and GenBank 
databases (release 96). No significant homologies were found to the sequences 
provided in SEQ ID NO: 2, 3, 19, 38 and 46. The sequences of SEQ ID NO: 1, 6-8, 10- 
13, 15, 17, 18, 20-27, 29, 30, 32, 34-37, 39-45, 47-49, 51, 52, 54, 55 and 57-59 were 
found to show some homology to previously identified expressed sequence tags (ESTs). 
20 The sequences of SEQ ID NO: 9, 28, 31 and 33 were found to show some homology to 
previously identified non-human gene sequences and the sequences of SEQ ID NO: 4, 
5, 14, 50, 53, 56 and 60 were found to show some homology to gene sequences 

previously identified in humans. 

The subtraction procedure described above was repeated using the above 

25 lung squamous cell carcinoma cDNA library as the tracer DNA, and the above normal 
lung tissue cDNA library and a cDNA library from nonnal liver and heart (constructed 
from a pool of one sample of each tissue as described above), plus twenty other cDNA 
clones that were frequently recovered in lung subtractions I and II, as the driver DNA 
(lung subtraction III). The normal liver and heart cDNA library contained 1.76 x 10 6 

30 independent colonies, with 100% of clones having inserts and the average insert size 
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being 1600 base pairs. Ten additional clones were isolated (SEQ ID NO: 61-70). 
Comparison of these cDN A sequences with those in the gene bank as described above, 
revealed no significant homologies to the sequences provided in SEQ ED NO: 62 and 
67. The sequences of SEQ ID NO: 61, 63-66, 68 and 69 were found to show some 
5 homology to previously isolated ESTs and the sequence provided in SEQ ID NO: 70 
was found to show some homology to a previously identified rat gene. 

In further studies, the subtraction procedure described above was 
repeated using the above lung squamous cell carcinoma cDNA library as the tracer 
DNA, and a cDNA library from a pool of normal lung, kidney, colon, pancreas, brain, 

10 resting PBMC, heart, skin and esophagus as the driver DNA, with esophagus cDNAs 
making up one third of the driver material Since esophagus is enriched in normal 
epithelial cells, including differentiated squamous cells, this procedure is likely to 
enrich genes that are tumor specific rather than tissues specific. The cDNA sequences 
of 48 clones determined in this subtraction are provided in SEQ ID NO: 177-224. The 

15 sequences of SEQ ID NO: 177, 178, 180, 181, 183, 187, 192, 195-197, 208, 211, 212, 
215, 216, 218 and 219 showed some homology to previously identified genes. The 
sequences of SEQ ID NO: 179, 182, 184-186, 188-191, 193, 194, 198-207, 209 210, 
213, 214, 217, 220 and 224 showed some homology to previously determined ESTs. 
The sequence of SEQ ID NO: 221-223 showed no homology to any previously 

20 determined sequence. ' 

B. ISOLATION OF cDNA SEQUENCES FROM A LUNG 
ADENOCARCINOMA LIBRARY 

A human lung adenocarcinoma cDNA expression library was 
constructed as described above. The library contained 3.2 x 10 6 independent colonies, 
25 with 100% of clones having an insert and the average insert size being 1500 base pairs. 
Library subtraction was performed as described above using the normal lung and 
normal liver and heart cDNA expression libraries described ^bove as the driver DNA. 
Twenty-six hundred independent clones were recovered. 
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Initial cDNA sequence analysis from 100 independent clones revealed 
many ribosomal protein genes. The cDNA sequences for fifteen clones isolated in this 
subtraction are provided in SEQ ID NO: 71-86. Comparison of these sequences with 
those in the gene bank as described above revealed no significant homologies to the 
5 sequence provided in SEQ ID NO: 84. The sequences of SEQ ID NO: 71, 73, 74, 77, 
78 and 80-82 were found to show some homology to previously isolated ESTs, and the 
sequences of SEQ ID NO: 72, 75, 76, 79, 83 and 85 were found to show some 
homology to previously identified human genes. 

In further studies, a cDNA library (referred to as mets3616A) was 
10 constructed from a metastatic lung adenocarcinoma. The determined cDNA sequences 
of 25 clones sequenced at random from this library are provided in SEQ ID NO: 255- 
279. The mets3616A cDNA library was subtracted against a cDNA library prepared 
from a pool of normal lung, liver, pancreas, skin, kidney, brain and resting PBMC. To 
increase the specificity of the subtraction, the driver was spiked with genes that were 
15 determined to be most abundant in the mets3616A cDNA library, such as EFl-alpha, 
■ integrin-beta and anticoagulant protein PP4, as well as with cDNAs that were 
previously found to-be differentially expressed in subtracted lung adenocarcinoma 
cDNA libraries. The <letermined cDNA sequences of 51 clones isolated from the 
subtracted library (referred to as mets3616A-Sl) are provided in SEQ ID NO: 280-330. 
20 Comparison of the sequences of SEQ ID NO: 255-330 with those in the 

public databases revealed no significant homologies to the sequences of SEQ ID NO: 
255-258, 260, 262-264, 270, 272, 275, 276, 279, 281, 287, 291, 296, 300 and 310. The 
sequences of SEQ ED NO: 259, 261, 265-269, 271, 273, 274, 277, 278, 282-285, 288- 
290, 292, 294, 297-299, 301, 303-309, 313, 314, 316, 320-324 and 326-330 snowed 
25 some homology to previously identified gene sequences, while the sequences of SEQ ID 
NO: 280, 286, 293, 302, 310, 312, 315, 317-319 and 325 showed some homology to 
previously isolated expressed sequence tags (ESTs). 
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EXAMPLE 2 

DETERMINATION OF TISSUE SPECIFICITY OF LUNG TUMOR POLYPEPTIDES 
Using gene specific primers, mRNA expression levels for seven 
representative lung tumor polypeptides described in Example 1 were examined in a 
5 variety of normal and tumor tissues using RT-PCR. 

Briefly, total RNA was extracted from a variety of normal and tumor 
tissues using Trizol reagent as described above. First strand synthesis was carried out 
using 2 fig of total RNA with Superscript II reverse transcriptase (BRL Life 
Technologies) at 42 °C for one hour. The cDNA was then amplified by PCR with gene- 

10 specific primers. To ensure the semi-quantitative nature of the RT-PCR, p-actin was 
used as an internal control for each of the tissues examined. 1 ^1 of 1:30 dilution of 
cDNA was employed to enable the linear range amplification of the p-actin template 
and was sensitive enough to reflect the differences in the initial copy numbers: Using 
these conditions, the p-actin levels were determined for each reverse transcription 

15 reaction from each tissue. DNA contamination was minimized by DNase treatment and 
by assuring a negative PCR result when using first strand cDNA that was prepared 
without adding reverse transcriptase. 

mRNA Expression levels were examined in five different types of tumor 
tissue (lung squamous cell carcinoma from 3 patients, lung adenocarcinoma, colon 

20 tumor from 2 patients, breast tumor and prostate tumor), and thirteen different normal 
tissues (lung from 4 donors, prostate, brain, kidney, liver, ovary, skeletal muscle, skin, 
small intestine, stomach, myocardium, retina and testes). Using a 10-fold amount of 
cDNA, the antigen LST-S1-90 (SEQ ID NO: 3) was found to be expressed at high 
levels in lung squamous cell carcinoma and in breast tumor, and at low to undetectable 

25 levels in the other tissues examined. 

The antigen LST-S2-68 (SEQ ID NO: 15) appears to be specific to lung 
and breast tumor, however, expression was also detected in normal kidney. Antigens 
LST-S1-169 (SEQ ID NO: 6) and LST-S1-133 (SEQ ID NO: 5) appear to be very 
abundant in lung tissues (both normal and tumor), with the expression of these two 

30 genes being decreased in most of the normal tissues tested. Both LST-S 1-169 and LST- 
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Sl-133 v^ere also expressed in breast and colon tumors. Antigens LST-S1-6 (SEQ ID 
NO: 7) and LST-S2-I2-5F (SEQ ID NO: 47) did not show tumor or tissue specific 
expression, with the expression of LST-S1-28 being rare and only detectable in a few 
tissues. The antigen LST-S3-7 (SEQ ID NO: 63) showed lung and breast tumor specific 
5 expression, with its message only being detected in normal testes when the PCR was 
performed for 30 cycles. Lower level expression was detected in some normal tissues 
when the cycle number was increased to 35. Antigen LST-S3-13 (SEQ ID NO: 66) was 
found to be expressed in 3 out of 4 lung tumors, one breast tumor and both colon tumor 
samples. Its expression in normal tissues was lower compared to tumors, and was only 
10 detected in 1 out of 4 normal lung tissues and in normal tissues from kidney, ovary and 
retina. Expression of antigens LST-S3-4 (SEQ ID NO: 62) and LST-S3-14 (SEQ ID 
NO: 67) was rare and did not show any tissue or tumor specificity. Consistent with 
Northern blot analyses, the RT-PCR results on antigen LAT-S1-A-10A (SEQ ID NO: 
78) suggested that its expression is high in lung, colon, stomach and small intestine 
15 tissues, including lung and colon tumors, whereas its expression was low or 
undetectable in other tissues. 

A total of 2002 cDNA fragments isolated in lung subtractions I, II and 
IE, described above, were colony PCR amplified and their mRNA expression levels in 
lung tumor, normal lung, and various other normal and tumor tissues were determined 
20 using microarray technology (Synteni, Palo Alto, CA). Briefly, the PCR amplification 
products were dotted onto slides in an array format, with each product occ\q>ying a 
unique location in the array. mRNA was extracted from the tissue sample to be tested, 
reverse transcribed, and fluorescent-labeled cDNA probes were generated. The 
microarrays were probed with the labeled cDNA probes, the slides scanned and 
25 fluorescence intensity was measured. This intensity correlates with the hybridization 
intensity. Seventeen non-redundant cDNA clones showed over-expression in lung 
squamous tumors, with expression in normal tissues tested (lung, skin, lymph node, 
colon, liver, pancreas, breast, heart, bone marrow, large intestine, kidney, stomach, 
brain, small intestine, bladder and salivary gland) being either undetectable, or 10-fold 
30 less compared to lung squamous tumors. The determined cDNA sequences for the 
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clone I413S are provided in SEQ ID NO: 87 and 88; those for L514S are provided in 
SEQ ID NO: 89 and 90; those for L516S in SEQ ID NO: 91 and 92; that for L517S in 
SEQ ID NO: 93; that for L5 1 9S in SEQ ID NO: 94; those for L520S in SEQ ID NO: 95 
and 96; those for L521S in SEQ ID NO: 97 and 98; that for L522S in SEQ ID NO: 99; 
5 that for L523S in SEQ ID NO: 100; that for L524S in SEQ ID NO: 101; that for L525S 
in SEQ ID NO: 102; that for L526S in SEQ ID NO: 103; that for L527S in SEQ ID NO: 
104; that for L528S in SEQ ID NO: 105; that for L529S in SEQ ID NO: 106; and those 
for L530S in SEQ ID NO: 107 and 108. Additionally, the full-length cDNA sequence 
for L530S is provided in SEQ ID NO: 151, with the corresponding amino acid sequence 
10 being provided in SEQ ID NO: 152. L530S shows homology to a splice variant of a 
p53 tumor suppressor homologue, p63. The cDNA sequences of 7 known isoforms of 
p63 are provided in SEQ ID NO: 331-337, with the corresponding amino acid 
sequences being provided in SEQ ID NO: 338-344, respectively. 

Due to polymorphisms, the clone L53 1 S appears to have two forms. A 

15 first determined full-length cDNA sequence for L531S is provided in SEQ ID NO: 109, 
with the corresponding amino acid sequence being provided in SEQ ID NO: 110. A 
second determined full-length cDNA sequence for L531S is provided in SEQ ID NO: 
111, with the corresponding amino acid sequence being provided in SEQ ID NO: 1 12. 
The sequence of SEQ ID NO: 1 1 1 is identical to that of SEQ ID NO: 109, except that it 

20 contains a 27 bp insertion. Similarly, L514S has two alternatively spliced forms; the 
first variant cDNA is listed as SEQ ID NO: 153, with the corresponding amino acid 
sequence being provided in SEQ ED NO: 155. The full-length cDNA for the second 
variant form of L514S is provided in SEQ ID NO: 154, with the corresponding amino 
acid sequence being provided in SEQ ID NO: 1 56. 

25 Full length cloning for L524S (SEQ ID NO: 101) yielded two variants 

(SEQ ID NO: 163 and 164) with the corresponding amino acid sequences of SEQ ID 
NO: 165 and 166, respectively. Both variants have been shown to encode parathyroid 
hormone-related peptide. 

Attempts to isolate the full-length cDNA for L519S, resulted in the 

3 0 isolation of the extended cDNA sequence provided in SEQ ID NO: 1 73, which contains 
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a potential open reading frame. The amino acid sequence encoded by the sequence of 
SEQ ID NO: 173 is provided in SEQ ID NO: 174. Additionally, the full-length cDNA 
sequence for the clone of SEQ ID NO: 100 (known as L523S), a known gene, is 
provided in SEQ ID NO: 175, with the corresponding amino acid sequence being 
5 provided in SEQ ID NO: 176. In further studies, a full-length cDNA sequence for 
L523S was isolated from a L523S-positive tumor cDNA library by PCR amplification 
using gene specific primers designed from the sequence of SEQ ID NO: 175. The 
determined full-length cDNA sequence is provided in SEQ ID NO: 347. The amino 
acid sequence encoded by this sequence is provided in SEQ ID NO: 348. This protein 
10 sequence differs from the previously published protein sequence at two amino acid 
positions, namely at positions 1 58 and 410. 

Comparison of the sequences of L514S and L531S (SEQ ID NO: 87 and 
88, and 109, respectively) with those in the gene bank, as described above, revealed no 
significant homologies to known sequences. The sequences of L513S, L516S, L517S, 
15 L519S, L520S and L530S (SEQ ID NO: 87 and 88, 91 and 92, 93, 94, 95 and 96, 107 
and 108, respectively) were found to show some homology to previously identified 
ESTs. The sequences of L521S, L522S, L523S, L524S, L525S, L526S, L527S, L528S 
and L529S (SEQ ID NO: 97 and 98, 99, 99, 101, 102, 103, 104, 105, and 106, 
respectively) were found to represent known genes. The determined full-length cDNA 
20 sequence for L520S is provided in SEQ ID NO: 113, with the corresponding amino 
acid sequence being provided in SEQ ID NO: 114. Subsequent microarray analysis 
showed L520S to be overexpressed in breast tumors in addition to lung squamous 
tumors. 

Further analysis demonstrated that L529S (SEQ ID NO: 106 and 115),. 

25 L525S (SEQ ID NO: 102 and 120) and L527S (SEQ ID NO: 104) are cytoskeletal 
components and potentially squamous cell specific proteins. L529S is connexin 26, a 
gap junction protein. It was found to be highly expressed in one lung squamous tumor, 
referred to as 9688T, and moderately over-expressed in two, others. However, lower 
level expression of connexin 26 is also detectable in normal skin, colon, liver and 

30 stomach. The over-expression of connexin 26 in some breast tumors has been reported 
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and a mytated form of L529S may result in over-expression in lung tumors. L525S is 
plakophilin 1, a desmosomal protein found in plaque-bearing adhering junctions of the 
skin. Expression levels for L525S mRNA was highly elevated in three out of four lung 
squamous tumors tested, and in normal skin. L527S has been identified as keratin 6 
5 isoform, type II 58 Kd keratin and cytokeratin 13, and shows over-expression in 
squamous tumors and low expression in normal skin, breast and colon tissues. Keratin 
and keratin-related genes have been extensively documented as potential markers for 
lung cancer including CYFRA2.1 (Pastor, A., et al, Eur. Respir. J., 10:603-609, 1997). 
L513S (SEQ ID NO: 87 and 88) shows moderate over-expression in several tumor 
10 tissues tested, and encodes a protein that was first isolated as a pemphigus vulgaris 
antigen, 

L520S (SEQ ID NO: 95 and 96) and L521S (SEQ ID NO: 97 and 98) are 
highly expressed in Jung squamous tumors, with L520S being up-regulated in normal 
salivary gland and L521 S being over-expressed in normal skin. Both belong to a family 

15 of small proline rich proteins and represent markers for fully differentiated squamous 
cells. L521S has been described as a specific marker for lung squamous tumor (Hu, R., 
et al, Lung Cancer, 20:25-30, 1998). L515S (SEQ ID NO: 162) encodes IGF-P2 and 
L516S is an aldose reductase homologue. Both are moderately expressed in lung 
squamous tumors and in normal colon. Notably, L5 1 6S (SEQ ID NO: 91 and 92) is up- 

20 regulated in metastatic tumors but not primary lung adenocarcinoma, an indication of its 
potential role in metatasis and a potential prognostic marker. L522S (SEQ ID NO: 99) 
is moderately over-expressed in lung squamous tumors with minimum expression in 
normal tissues. L522S has been shown to belong to a class IV alcohol dehydrogenase, 
ADH7, and its expression profile suggests it is a squamous cell specific antigen. L523S 

25 (SEQ ID NO: 100) is moderately over-expressed in lung squamous tumor, human 
pancreatic cancer cell lines and pancreatic cancer tissues, suggesting this gene may be a 
shared antigen between pancreatic and lung squamous cell cancer. 

L524S (SEQ ID NO: 101) is over-expressed injthe majority of squamous 
tumors tested and is homologous with parathyroid hormone-related peptide (PTHrP), 

30 which is best known to cause humoral hypercalcaemia associated with malignant 
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tumors such as leukemia, prostate and breast cancer. It is also believed that PTHrP is 
most commonly associated with squamous carcinoma of lung and rarely with lung 
adenocarcinoma (Davidson, LA, et al, J. Pathol, 178: 398-401, 1996). L528S (SEQ 
ID NO: 105) is highly over-expressed in two lung squamous tumors with moderate 
5 expression in two other squamous tumors, one lung adenocarcinoma and some normal 
tissues, including skin, lymph nodes, heart, stomach and lung. It encodes the NMB 
gene that is similar to the precursor of melanocyte specific gene Pmell7, which is 
reported to be preferentially expressed in low-metastatic potential melanoma cell lines. 
This suggests that L528S may be a shared antigen in both melanoma and rung squamous 
10 cell carcinoma. L526S (SEQ ID NO: 103) was overexpressed in all lung squamous cell 
tumor tissues tested and has been shown to share homology with a gene (ATM) in 
which a mutation causes ataxia telangiectasia, a genetic disorder in humans causing a 
predisposition to cancer, among other symptoms. ATM encodes a protein that activates 
a P 53 mediated cell-cycle checkpoint through direct binding and phosphorylation of the 
15 P 53 molecule. Approximately 40% of lung cancers are associated with P 53 mutations, 
and it is speculated that over-expression of ATM is a result of compensation for loss of 
p53 function, but it is unknown whether over-expression is the cause of result of lung 
squamous cell carcinoma. Additionally, expression of L526S (ATM) is also detected in 
a metastatic but not lung adenocarcinoma, suggesting a role in metastasis. 
20 Expression of L523S (SEQ ID NO: 175), was examined by real time 

RT-PCR as described above. In a first study using a panel of lung squamous tumors, 
L523S was found to be expressed in 4/7 lung squamous tumors, 2/3 head and neck 
squamous tumors and 2/2 lung adenocarcinomas, with low level expression being 
observed in skeletal muscle, soft palate and tonsil. In a second study using a lung 
25 adenocarcinoma panel, expression of L523S was observed in 4/9 primary 
adenocarcinomas, 2/2 lung pleural effusions, 1/1 metastatic lung adenocarcinomas and 
2/2 lung squamous tumors, with little expression being observed in normal tissues. 

Expression of L523S in lung tumors and various normal tissues was also 
examined by Northern blot analysis, using standard techniques. In a first study, L523S 
30 was found to be expressed in a number of lung adenocarcinomas and squamous cell 
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carcinomas, as well as normal tonsil. No expression was observed in normal lung. In a 
second study using a normal tissue blot (referred to as HB-12) from Clontech, no 
expression was observed in brain, skeletal muscle, colon, thymus, spleen, kidney, liver, 
small intestine, lung or PBMC, although there was strong expression in placenta. 

5 

EXAMPLE 3 

ISOLATION AND CHARACTERIZATION OF LUNG TUMOR POLYPEPTIDES 
BY PCR-BASED SUBTRACTION 
Eight hundred and fifty seven clones from a cDNA subtraction library, 
10 containing cDNA from a pool of two human lung squamous tumors subtracted against 
eight normal human tissue cDNAs including lung, PBMC, brain, heart, kidney, liver, 
pancreas, and skin, (Clontech, Palo Alto, CA) were derived and submitted to a first 
round of PCR amplification. This library was subjected to a second round of PCR 
amplification, following the manufacturer's protocol. The resulting cDNA fragments 
15 were subcloned into, the P7-Adv vector (Clontech, Palo Alto, CA) and transformed into 
DH5a E. coli (Gibco, BRL). DNA was isolated from independent clones and 
sequenced using a Perkin Elmer/Applied Biosystems Division Automated Sequencer 
Model 373A. 

One hundred and sixty two positive clones were sequenced. Comparison 
20 of the DNA sequences of these clones with th6se in the EMBL and GenBank databases, 
as described above, revealed no significant homologies to 13 of these clones, hereinafter 
referred to as Contigs 13, 16, 17, 19, 22, 24, 29, 47, 49, 56-59. The determined cDNA 
sequences for these clones are provided in SEQ ID NO: 125, 127-129, 131-133, 142, 
144, 148-150, and 157, respectively. Contigs 1, 3-5, 7-10, 12, 1 1, 15, 20, 31, 33, 38, 39, 
25 41, 43, 44, 45, 48, 50, 53, 54 (SEQ ID NO: 1 15-124, 126, 130, 134-141, 143, 145-147, 
respectively) were found to show some degree of homology to previously identified 
DNA sequences. Contig 57 (SEQ ID NO: 149) was found to represent the clone L5 19S 
(SEQ ID NO: 94) disclosed in US. Patent Application No. Q9/123,912, filed July 27, 
1998. To the best of the inventors* knowledge, none of these sequences have been 
30 previously shown to be differentially over-expressed in lung tumors. 
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/ mRNA expression levels for representative clones in lung tumor tissues, 
normal lung tissues (n=4), resting PBMC, salivary gland, heart, stomach, lymph nodes, 
skeletal muscle, soft palate, small intestine, large intestine, bronchial, bladder, tonsil, 
kidney, esophagus, bone marrow, colon, adrenal gland, pancreas, and skin (all derived 
5 from human) were determined by RT-PCR as described above. Expression levels using 
microarray technology, as described above, were examined in one sample of each tissue 

type unless otherwise indicated. 

Contig 3 (SEQ ID NO: 116) was found to be highly expressed in all head 
and neck squamous cell tumore tested (17/17), and expressed in the majority (8/12) of 
10 lung squamous tumors, (high expression in 7/12, moderate in 2/12, and low in 2/12), 
while showing negative expression for 2/4 normal lung tissues and low expression in 
the remaining two samples. Contig 3 showed moderate expression in skin and soft 
palate, and lowered expression levels in resting PBMC, large intestine, salivary gland, 
tonsil, pancreas, esophagus, and colon. Contig 1 1 (SEQ ID NO: 124) was found to be 
15 expressed in all head and neck squamous cell tumors tested (17/17), with high levels of 
expression being seen in 14/17 tumors, and moderately levels of expression being seen 
in 3/17 tumors. Additionally, high expression was seen in 3/12 lung squamous tumors 
and moderate expression in 4/12 lung squamous tumors. Contig 1 1 was negative for 3/4 
normal lung samples, with the remaining sample having only low expression. Contig 
20 1 1 showed low to moderate reactivity to salivary gland, soft palate, bladder, tonsil, skin, 
esophagus, and large intestine. Contig 13 (SEQ ID NO: 125) was found to be expressed 
in all head and neck squamous cell tumors tested (17/17), with high expression in 
12/17, and moderate expression in 5/17. Contig 13 was expressed in 7/12 lung 
squamous tumors, with high expression in 4/12 and moderate expression in three 
25 samples. Analysis of normal lung samples showed negative expression for 2/4 and low 
to moderate expression in the remaining two samples. Contig 13 showed low to 
moderate reactivity to resting PBMC, salivary gland, bladder, pancreas, tonsil, skin, 
esophagus, and large intestine, as well as high expression in soft palate. Subsequent 
full-length cloning efforts revealed that contig 13 (also known as L761P) maps to the 3' 
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untranslated region of the hSeclOp gene. The full-length sequence for this gene is set 
forth in SEQ ID NO: 368, and encodes the protein set forth in SEQ ID NO: 369. 

Contig 16 (SEQ ID NO: 127) was found to be moderately expressed in 
several head and neck squamous cell tumors (6/17) and one lung squamous tumor, 
5 while showing no expression in any normal lung samples tested. Contig 16 showed low 
reactivity to resting PBMC, large intestine, skin, salivary gland, and soft palate. Contig 
17 (SEQ ID NO: 128) was shown to be expressed in all head and neck squamous cell 
tumors tested (17/17) (highly expressed in 5/17, and moderately expressed in 12/17). 
Determination of expression levels in lung squamous tumors showed one tumor sample 

10 with high expression and 3/12 with moderate levels. Contig 17 was negative for 2/4 
normal lung samples, with the remaining samples having only low expression. 
Additionally, low level expression was found in esophagus and soft palate. Contig 19 
(SEQ ID NO: 129) was found to be expressed in most head and neck squamous cell 
tumors tested (11/17); with two samples having high expression levels, 6/17 showing 

15 moderate expression, and low expression being found in 3/17. Testing in lung 
squamous tumors revealed only moderate expression in 3/12 samples. Expression 
levels in 2/4 of normal lung samples were negative, the two other samples having only 
low expression. Contig 19 showed low expression levels in esophagus, resting PBMC, 
salivary gland, bladder, soft palate and pancreas. 

20 Contig 22 (SEQ ID NO: 131), Was shown to be expressed in most head 

and neck squamous cell tumors tested (13/17) with high expression in four of these 
samples, moderate expression in 6/17, and low expression in 3/17. Expression levels in 
lung squamous tumors were found to be moderate to high for 3/12 tissues tested, with 
negative expression in two normal lung samples and low expression in two other 

25 samples (n=4). Contig 22 showed low expression in skin, salivary gland and soft 
palate. Similarly, Contig 24 (SEQ ID NO: 132) was found to be expressed in most head 
and neck squamous cell tumors tested (13/17) with high expression in three of these 
samples, moderate expression in 6/17, and low expression in 4(17. Expression levels in 
lung squamous tumors were found to be moderate to high for 3/12 tissues tested, with 

30 negative expression for three normal lung samples and low expression in one sample 
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(n=4). Gontig 24 showed low expression in skin, salivary gland and soft palate. 
Contig 29 (SEQ ID NO: 133) was expressed in nearly all head and neck squamous cell 
tumors tested (16/17): highly expressed in 4/17, moderately expressed in 11/17, with 
low expression in one sample. Also, it was moderately expressed in 3/12 lung 
5 squamous tumors, while being negative for 2/4 normal lung samples. Contig 29 
showed low to moderate expression in large intestine, skin, salivary gland, pancreas, 
tonsil, heart and soft palate. Contig 47 (SEQ ID NO: 142) was expressed in most head 
and neck squamous cell tumors tested (12/17): moderate expression in 10/17, and low 
expression in two samples. In lung squamous tumors, it was highly expressed in one 
1 0 sample and moderately expressed in two others (n=l 3). Contig 47 was negative for 2/4 
normal lung samples, with the remaining two samples having moderate expression. 
Also, Contig 47 showed moderate expression in large intestine, and pancreas, and low 
expression in skin, salivaiy gland, soft palate, stomach, bladder, resting PBMC, and 
tonsil. 

15 Contig 48 (SEQ ID NO: 143) was expressed in all head and neck 

squamous cell tumors tested (17/17): highly expressed in 8/17 and moderately 
expressed in 7/17, with low expression in two samples. Expression levels in lung 
squamous tumors were high to moderate in three samples (n=13). Contig 48 was 
negative for one out of four normal lung samples, the remaining showing low or 

20 moderate expression. Contig 48 showed moderate expression in soft palate, large 
intestine, pancreas, and bladder, and low expression in esophagus, salivary gland, 
resting PBMC, and heart. Contig 49 (SEQ ID NO: 144) was expressed at low to 
moderate levels in 6/17 head and neck squamous cell tumors tested. Expression levels 
in lung squamous tumors were moderate in three samples (n=13). Contig 49 was 

25 negative for 2/4 normal lung samples, the remaining samples showing low expression. 
Moderate expression levels in skin, salivary gland, large intestine, pancreas, bladder and 
resting PBMC were shown, as well as low expression in soft palate, lymph nodes, and 
tonsil. Contig 56 (SEQ ID NO: 148) was expressed in low to moderate levels in 3/17 
head and neck squamous cell tumors tested, and in lung squamous tumors, showing low 

30 to moderate levels in three out of thirteen samples. Notably, low expression levels were 
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detected in one adenocarcinoma lung tumor sample (n=2). Contig 56 was negative for 
3/4 normal lung samples, and showed moderate expression levels in only large intestine, 
and low expression in salivary gland, soft palate, pancreas, bladder, and resting PBMC. 
Contig 58, also known as L769P, (SEQ ID NO: 150) was expressed at moderate levels 
5 in 11/17 head and neck squamous cell tumors tested and low expression in one 
additional sample. Expression in lung squamous tumors showed low to moderate levels 
in three out of thirteen samples. Contig 58 was negative for 3/4 normal lung samples, 
with one sample having low expression. Moderate expression levels in skin, large 
intestine, and resting PBMC were demonstrated, as well as low expression in salivary 
1 0 gland, soft palate, pancreas, and bladder. Contig 59 (SEQ ID NO: 157) was expressed in 
some head, neck, and lung squamous tumors. Low level expression of Contig 59 was 
also detected in salivary gland and large intestine; 

The full-length cDNA sequence for Contig 22, also referred to as L763P, 
is provided in SEQ ID NO: 158, with the corresponding amino acid sequence being 

1 5 provided in SEQ ID NO: 159. Real-time RT-PCR analysis of L763P revealed that it is 
highly expressed in 3/4 lung squamous tumors as well as 4/4 head and neck squamous 
tumors, with low level expression being observed in normal brain, skin, soft pallet and 
trachea. Subsequent database searches revealed that the sequence of SEQ ID NO: 158 
contains a mutation, resulting in a frameshift in the corresponding protein sequence. A 

20 second cDNA sequence for L763P is provided in SEQ ID NO: 345, with the 
corresponding amino acid sequence being provided in SEQ ID NO: 346. The sequences 
of SEQ ID NO: 1 59 and 346 are identical with the exception of the C-terminal 33 amino 
acids of SEQ ID NO: 1 59. 

The full-length cDNA sequence incorporating Contigs 17, 19, and 24, 

25 referred to as L762P, is provided in SEQ ID NO: 160, with the corresponding amino 
acid sequence being provided in SEQ ID NO: 161. Further analysis of L762P has 
determined it to be a type I membrane protein and two additional variants have been 
sequenced. Variant 1 (SEQ ID NO: 167, with the corresponding amino acid sequence 
in SEQ ID NO: 169) is an alternatively spliced form of SEQ ID NO: 160 resulting in 

30 deletion of 503 nucleotides, as well as deletion of a short segment of the expressed 
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protein, yariant 2 (SEQ ID NO: 168, with the corresponding amino acid sequence in 
SEQ ID NO: 170) has a two nucleotide deletion at the 3' coding region in comparison 
to SEQ ID NO; 160, resulting in a secreted form of the expressed protein. Real-time 
RT-PCR analysis of L762P revealed that is over-expressed in 3/4 lung squamous 
5 tumors and 4/4 head & neck tumors, with low level expression being observed in 
normal skin, soft pallet and trachea. 

An epitope of L762P was identified as having the sequence 
KPGHWTYTLNNTHHSLQALK (SEQ ID NO: 382), which corresponds to amino 
acids 571-590 of SEQ ID NO:16L 

10 The full-length cDNA sequence for contig 56 (SEQ ID NO: 148), also 

referred to as L773P, is provided in SEQ ID NO: 171, with the amino acid sequence in 
SEQ ID NO: 172. L773P was found to be identical to dihydroxyl dehydrogenase at the 
3' portion of the gene, with divergent 5* sequence. As a result, the 69 N-terminal amino 
acids are unique. The cDNA sequence encoding the 69 N-terminal amino acids is 

15 provided in SEQ ID NO: 349, with the N-terminal amino acid sequence being provided 
in SEQ ID NO: 350. Real-time PCR revealed that L773P is highly expressed in lung 
squamous tumor and lung adenocarcinoma, with no detectable expression in normal 
tissues. Subsequent Northern blot analysis of L773P demonstrated that this transcript is 
differentially over-expressed in squamous tumors and detected at approximately 1.6 Kb 

20 in primary lung tumor tissue and approximately 1 .3 Kb in primary head and neck tumor 
tissue. 

Subsequent microarray analysis has shown Contig 58, also referred to as 
L769S (SEQ ID NO: 150), to be overexpressed in breast tumors in addition to lung 
squamous tumors. 

25 

EXAMPLE 4 

ISOLATION AND CHARACTERIZATION OF LUNG TUMOR POLYPEPTIDES 
BY PCR-BASED SUBTRACTION 
Seven hundred and sixty clones from a cDNA subtraction library, 
30 containing cDNA from a pool of two human lung primary adenocarcinomas subtracted 
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against a pool of nine normal human tissue cDNAs including skin, colon, lung, 
esophagus, brain, kidney, spleen, pancreas and liver, (Clontech, Palo Alto, CA) were 
derived and submitted to a first round of PCR amplification. This library (referred to as 
ALT-1) was subjected to a second round of PCR amplification, following the 
5 manufacturer's protocol. The expression levels of these 760 cDNA clones in lung 
tumor, normal lung, and various other normal and tumor tissues, were examined using 
microarray technology (Incyte, Palo Aho, CA). Briefly, the PCR amplification products 
were dotted onto slides in an array format, with each product occupying a unique 
location in the array. mRNA was extracted from the tissue sample to be tested, reverse 

10 transcribed, and fluorescent-labeled cDNA probes were generated. The microarrays 
were probed with the labeled cDNA probes, the slides scanned and fluorescence 
intensity was measured. This intensity correlates with the hybridization intensity.. A 
total of 118 clones, of which 55 were unique, were found to be over-expressed in lung 
tumor tissue, with expression in normal tissues tested (lung, skin, lymph node, colon, 

15 liver, pancreas, breast, heart, bone marrow, large intestine, kidney, stomach, brain, small 
intestine, bladder and salivary gland) being either undetectable, or at significantly lower 
levels. One of these clones, having the sequence as provided in SEQ ID NO:420 (clone 
#19014), shows homology to a previously identified clone, L773P. Clone L773P has 
the full-length cDNA sequence provided in SEQ ID NO: 171 and the amino acid 

20 sequence provided in SEQ ID NO:172 The isolation of clone #19014 is also described 
in co-pending U.S. Patent application 09/285,479, filed April 2, 1999. 

EXAMPLES 
SYNTHESIS OF POLYPEPTIDES 
25 Polypeptides may be synthesized on a Perkin Elmer/Applied Biosystems 

Division 430A peptide synthesizer using FMOC chemistry with HPTU (O- 
Benzolriazole-NJsfJSr^^tetramethyluronium hexafluorophosphate) activation. A Gly- 
Cys-Gly sequence may be attached to the amino terminus oi; the peptide to provide a 
method of conjugation, binding to an immobilized surface, or labeling of the peptide. 
30 Cleavage of the peptides from the solid support is carried out using the following 



130 



WO 02/00174 



PCT/US01/21065 



cleavage mixture: trifluoroacetic acid:ethanedithiol^hioanisolerwater:phenol 
(40:1:2:2:3). After cleaving for 2 hours, the peptides are precipitated in cold methyl-t- 
butyl-ether. The peptide pellets are then dissolved in water containing 0.1% 
trifluoroacetic acid (TFA) and lyophilized prior to purification by CI 8 reverse phase 
5 HPLC. A gradient of 0%-60% acetonitrile (containing 0. 1% TFA) in water (containing 
0.1% TFA) may be used to elute the peptides. Following lyophilization of the pure 
fractions, the peptides are characterized using electrospray or other types of mass 
spectrometry and by amino acid analysis. 

10 EXAMPLE 6 

PREPARATION OF ANTIBODIES AGAINST LUNG CANCER ANTIGENS 

Polyclonal antibodies against the lung cancer antigens L514S, L528S, 
L531S, L523 and L773P (SEQ ID NO: 155, 225, 112, 176 and 171, respectively) were 
prepared as follows. 

15 Rabbits were immunized with recombinant protein expressed in and 

purified from E. colt as described below. For the initial immunization, 400 jig of 
antigen combined with muramyl dipeptide (MDP) was injected subcutaneously (S.C.). 
Animals were boosted S.C. 4 weeks later with 200 \xg of antigen mixed with incomplete 
Freund's Adjuvant (IFA). Subsequent boosts of 100 fig of antigen mixed with BFA 

20 were injected S.C. as necessary to induce higjh antibody titer responses. Serum bleeds 
from immunized rabbits were tested for antigen-specific reactivity using ELISA assays 
with purified protein. Polyclonal antibodies against L514S, L528S, L531S, L523S and 
L773P were affinity purified from high titer polyclonal sera using purified protein 
attached to a solid support. 

25 Immunohistochemical analysis using polyclonal antibodies against 

L5 14S was performed on a panel of 5 lung tumor samples, 5 normal lung tissue samples 
and normal colon, kidney, liver, brain and bone marrow. Specifically, tissue samples 
were fixed in formalin solution for 24 hours and embedded in paraffin before being 
sliced into 10 micron sections. Tissue sections were permeabilized and incubated with 

30 antibody for 1 hr. HRP-labeled anti-mouse followed by incubation with DAB 
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chromogen was used to visualize L514S immunoreactivity. L514S was found to be 
highly expressed in lung tumor tissue with little or no expression being observed in 
normal lung, brain or bone marrow. Light staining was observed in colon (epithelial 
crypt cells positive) and kidney (tubules positive). Staining was seen in normal liver but 
5 no mRNA has been detected in this tissue making this result suspect. 

Using the same procedure, immunohistochemical analysis using 
polyclonal antibodies against L528S demonstrated staining in lung tumor and normal 
lung samples, light staining in colon and kidney, and no staining in liver and heart 

Immunohistochemical analysis using polyclonal antibodies against 

10 L531S demonstrated staining in lung tumor samples, light membrane staining in most 
normal lung samples, epithelial staining in colon, tubule staining in kidney, ductal 
epithelial staining in liver and no staining in heart. 

Immunohistochemical analysis using polyclonal antibodies against 
L523S demonstrated staining in all lung cancer samples tested but no staining in normal 

1 5 lung, kidney, liver, colon, bone marrow or cerebellum. 

Generation of polyclonal anti-sera against L762P (SEQ ID NO: 169 and 
170) was performed as follows. 400 micrograms of lung antigen was combined with 
100 micrograms of muramyldipeptide (MDP). An equal volume of Incomplete 
Freund's Adjuvant (IFA) was added and then mixed until an emulsion was formed. 

20 Rabbits were injected subcutaneously (S.C.). After four weeks the animals were 
injected S.C. with 200 micrograms of antigen mixed with an equal volume of IFA. 
Every four weeks animals were boosted with 100 micrograms of antigen. Seven days 
following each boost the animal was bled. Sera was generated by incubating the blood 
at 4°C for 12-24 hours followed by centrifugation. 

25 Characterization of polyclonal antisera was carried out as follows. 

Ninety-six well plates were coated with antigen by incubing with 50 microliters 
(typically 1 microgram) at 4°C for 20 hrs. 250 microliters of BSA blocking buffer was 
added to the wells and incubated at room temperature for 2 fys. Plates were washed 6 
times with PBS/0.01% Tween. Rabbit sera was diluted in PBSand 50 microliters of 

30 diluted sera was added to each well and incubated at room temperature for 30 min. 
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Plates wcjre washed as described above before addition of 50 microliters of goat anti- 
rabbit horse radish peroxidase (HRP) at a 1:10000 dilution and incubation at room 
temperature for 30 min. Plates were washed as described above and lOOjal of TMB 
Micro well Peroxidase Substrate was added to each well. Following a 15 minute 
5 incubation in the dark at room temperature, the colorimetric reaction was stopped with 
lOOjxl IN H2SO4 and read immediately at 450 nm. Antisera showed strong reactivity to 
antigen L762P. 

Immunohistochemical analysis using polyclonal antibodies against 
L762P demonstrated staining in all lung cancer samples tested, some light staining in 

10 the bronchiole epithelium of normal lung, tubule staining in kidney, light epithelial 
staining in colon and no staining in heart or liver. 

In order to evaluate L773P protein expression in various tissues, 
immunohistochemistry (IHC) analysis was performed using an affinity purified L773P 
polyclonal antibody. Briefly, tissue samples were fixed in formalin solution for 12-24 

15 hrs and embedded in paraffin before being sliced into 8 micron sections. Steam heat 
induced epitope retrieval (SHIER) in 0.1 M sodiuym citrate buffer (pH 6.0) was used 
for optimal staining conditions. Sections were incubated with 10% serum/PBS for 5 
minutes. Primary antibody was added to each section for 25 minutes at indicated 
concentrations followed by 25 minute incubation with either anti-rabbit or anti-mouse 

20 biotinylated antibody. Endogenous peroxidase activitiy was blocked by three 1 .5 minute 
incubations with hydrogen peroxidase. Hie avidin biotin complex/horse radish 
peroxidase (ABC/HRP) system was used along with DAB chromogen to visualize 
L773P expression. Slides were counterstainied with hematoxylin to visualize cell 
nuclei. Using this approach, L773P protein was detected in 6/8 lung tumors, 4/6 normal 

25 lung samples (very light staining in some cases), 1/1 kidney samples (very light 
staining), 0/1 heart samples, 1/1 colon samples (very light staining) and 0/1 liver 
samples. 
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EXAMPLE 7 

PEPTIDE PRIMING OF MICE AND PROPAGATION OF CTL LINES 

Immunogenic peptides from the lung cancer antigen L762P (SEQ ID 
5 NO: 161) for HLA-A2/K b -restricted CD8+ T cells were identified as follows. 

The location of HLA-A2 binding peptides within the lung cancer antigen 
L762P (SEQ ID NO: 161) was predicted using a computer program which predicts 
peptides sequences likely to being to HLA-A*0201 by fitting to the known peptide 
binding motif for HLA-A*0201 (Rupert et al (1993) Cell 74:929; Rammensee et al 
10 (1995) Immunogenetics 47:178-228). A series of 19 synthetic peptides corresponding 
to a selected subset of the predicted HLA-A*0201 binding peptides was prepared as 
described above. 

Mice expressing the transgene for human HLA A2/K b (provided by Dr L. 
Sherman, The Scripps Research Institute, La Jolla, CA) were immunized with the 

15 synthetic peptides, as described by Theobald et al., Proc. Natl Acad Sci. USA 
92:1 1993-1 1997, 1995, with the following modifications. Mice were immunized with 
50jig of L726P peptide and 120[ig of an I-A b binding peptide derived from hepatitis B 
virus protein emulsified in incomplete Freund's adjuvant Three weeks later these mice 
were sacrificed and single cell suspensions prepared. Cells were then resuspended at 7 

20 x 10 6 cells/ml in complete media (RPMH640; Gibco BRL, Gaithersburg, MD) 
containing 10% FCS, 2mM Glutamine (Gibco BRL), sodium pyruvate (Gibco BRL), 
non-essential amino acids (Gibco BRL), 2 x 10" 5 M 2-mercaptoethanol, 50U/ml 
penicillin and streptomycin, and cultured in the presence of irradiated (3000 rads) 
. L762P peptide- (5jig/ml) and lOmg/ml B2-microglobulin- (3 fxg/ml) LPS blasts (A2 

25 transgenic spleens cells cultured in the presence of 7jxg/ml dextran sulfate and 25|ig/ml 
LPS for 3 days). After six days, cells (5 x 10 5 /ml) were restimulated with 2.5 x 10 6 /ml 
peptide-pulsed irradiated (20,000 . rads) EL4A2Kb cells (Sherman et al, Science 
255:815-818, 1992) and 5 x 10 6 /ml irradiated (3000 rads) A2/K b -transgenic spleen 
feeder cells. Cells were cultured in the presence of lOU/ml IL-2. Cells were 

30 restimulated on a weekly basis as described, in preparation for cloning the line. 
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Peptide-specific cell lines were cloned by limiting dilution analysis with 
irradiated (20,000 rads) L762P peptide-pulsed EL4 A2Kb tumor cells (1 x 10 4 
cells/well) as stimulators and irradiated (3000 rads) A2/K b -transgenic spleen cells as 
feeders (5 x 1 0 5 cells/ well) grown in the presence of lOU/ml IL-2. On day 7, cells were 
5 restimulated as before. On day 14, clones that were growing were isolated and 
maintained in culture. 

Cell lines specific for the peptides L762P-87 (SEQ ID NO: 226; 
corresponding to amino acids 87-95 of SEQ ID NO: 161), L762P-145 (SEQ ID NO: 
227; corresponding to amino acids 145-153 of SEQ ID NO: 161), L762P-585 (SEQ ID 

10 NO: 228; corresponding to amino acids 585-593 of SEQ ID NO: 161), L762P-425 
(SEQ ID NO: 229; corresponding to amino acids 425-433 of SEQ ID NO: 161), 
L762P(10)-424 (SEQ ID NO: 230; corresponding to amino acids 424-433 of SEQ ID 
NO: 161) and L762P(10>458 (SEQ ID NO: 23 1; corresponding to amino acids 458-467 
of SEQ ID NO: 161) demonstrated significantly higher reactivity (as measured by 

15 percent specific lysis) against L762P peptide-pulsed EL4-A2/K b tumor target cells than 
control peptide-pulsed EL4-A2/K b tumor target cells. 

EXAMPLE 8 

IDENTIFICATION OF CD4 IMMUNOGENIC T CELL EPITOPES DERIVED FROM 
20 THE LUNG CANCER ANTIGEN L762P 

CD4 T cell lines specific for the antigen L762P (SEQ ID NO: 161) were 

generated as follows. 

A series of 28 overlapping peptides were synthesized that spanned 

approximately 50% of the L762P sequence. For priming, peptides were combined into 
25 pools of 4-5 peptides, pulsed at 20 micrograms/ml into dendritic cells for 24 hours. The 

dendritic cells were then washed and mixed with positively selected CD4+ T cells in 96 

well U-bottomed plates. Forty cultures were generated for each peptide pool. Cultures 

were restimulated weekly with fresh dendritic cells loaded with peptide pools. 

Following a total of 3 stimulation cycles, cells were rested for an additional week and 
30 tested for specificity to antigen presenting cells (APC) pulsed with peptide pools using 
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interferon-gamma ELISA and proliferation assays. For these assays, adherent 
monocytes loaded with either the relevant peptide pool or an irrelevant peptide were 
used as APC. T cell lines that appeared to specifically recognize L762P peptide pools 
both by cytokine release and proliferation were identified for each pool. Emphasis was 
5 placed on identifying T cells with proliferative responses. T cell lines that demonstrated 
either both L762P-specific cytokine secretion and proliferation, or strong proliferation 

t 

alone were further expanded to be tested for recognition of individual peptides from the 
pools, as well as for recognition of recombinant L762P. The source of recombinant 
L762P was E. coli, and the material was partially purified and endotoxin positive. 
10 These studies employed 10 micrograms of individual peptides, 10 or 2 micrograms of 
an irrelevant peptide, and 2 or 0.5 micrograms of either L762P protein or an irrelevant, 
equally impure, E. coli generated recombinant protein. Significant interferon-gamma 
production and CD4 T cell proliferation was induced by a number of L762P-derived 
peptides in each pool. The amino acid sequences for these peptides are provided in 

15 SEQ ID NO: 232-251. These peptides correspond to amino acids 661-680, 676-696, 
526-545, 874-893, 811-830, 871-891, 856-875, 826-845, 795-815, 736-755, 706-725, 
706-725, 691-710, 601-620, 571-590, 556-575, 616-635, 646-665, 631-650, 541-560 
and 586-605, respectively, of SEQ ID NO: 161 . 

CD4 T cell lines that demonstrated specificity for individual L762P- 

20 derived peptides were further expanded by stimulation with the relevant peptide at 10 
micrograms/ml. Two weeks post-stimulation, T cell lines were tested using both 
proliferation and IFN-gamma ELISA assays for recognition of the specific peptide. A 
number of previously identified T cells continued to demonstrate L762P-peptide 
specific activity. Each of these lines was further expanded on the relevant peptide and, 

25 following two weeks of expansion, tested for specific recognition of the L762P-peptide 
in titration experiments, as well as for recognition of recombinant E cofr-derived L762P 
protein. For these experiments, autologous adherent monocytes were pulsed with either 
the relevant L762P-derived peptide, an irrelevant mammaglobin-derived peptide, 
recombinant £ co/i-derived L762P (approx. 50% pure), or an irrelevant E. co//-derived 

30 protein. The majority of T cell lines were found to show low affinity for the relevant 
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peptide, since specific proliferation and IFN-gamma ratios dramatically decreased as 
L762P peptide was diluted. However, four lines were identified that demonstrated 
significant activity even at 0.1 micrograms/ml peptide. Each of these lines (referred to 
as A/D5, D/F5, E/A7 and E/B6) also appeared to specifically proliferate in response to 
5 the E. co//-derived L762P protein preparation, but not in response to the irrelevant 
protein preparation. The amino acid sequences of the L762P-derived peptides 
recognized by these lines are provided in SEQ ID NO: 234, 249, 236 and 245, 
respectively. No protein specific IFN-gamma was detected for any of the lines. Lines 
A/D5, E/A7 and E/B6 were cloned on autologous adherent monocytes pulsed with the 
10 relevant peptide at 0.1 (A/D5 and E/A7) or 1 (D/F5) microgram/ml. Following growth, 
clones were tested for specificity for the relevant peptide. Numerous clones specific for 
the relevant peptide were identified for lines A/D5 and E/A7. 

EXAMPLE 9 

15 PROTEIN EXPRESSION OF LUNG TUMOR-SPECIFIC ANTIGENS 

a) Expression of L514S in K coli 

The lung tumor antigen L514S (SEQ ID NO: 89) was subcloned into the 
expression vector pE32b at Ncol and NotI sites, and transformed into K coli using 
standard techniques. The protein was expressed from residues 3-153 of SEQ ID NO: 
20 89. The expressed amino acid sequence and the corresponding DNA sequence are 
provided in SEQ ID NO: 252 and 253, respectively. 

b) Expression of L762P 

Amino acids 32-944 of the lung tumor antigen L762P (SEQ ID NO: 161), with a 6X His 
Tag, were subcloned into a modified pET28 expression vector, using kanamycin 
25 resistance, and transformed into BL21 CodonPlus using standard techniques. Low to 
moderate levels of expression were observed. The determined DNA sequence of the 
L762P expression construct is provided in SEQ ID NO: 254. 
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EXAMPLE 10 

IDENTIFICATION OF MHC CLASS II RESTRICTING ALLELE FOR L762P 
PEPTPE-SPECIFIC RESPONSES 
5 A panel of HLA mismatched antigen presenting cells (APC) were used 

to identify the MHC class II restricting allele for the L762P-peptide specific responses 
of CD4 T cell clones derived from lines that recognized L762P peptide and recombinant 
protein. Clones from two lines, AD-5 and EA-7, were tested as described below. The 
AD-5 derived clones were found to be restricted by the HLA-DRB-1 101 allele, and an 

10 EA-7 derived clone was found to be restricted by the HLA DRB-0701 or DQB1-0202 
allele. Identification of the restriction allele allows targeting of vaccine therapies using 
the defined peptide to individuals that express the relevant class II allele. Knowing the 
relevant restricting allele will also enable clinical monitoring for responses to the 
defined peptide since only individuals that express the relevant allele will be monitored. 

15 CD4 T cell clones derived from line AD-5 and EA-7 were stimulated on 

autologous APC pulsed with the specific peptide at 10 *ig/ml, and tested for recognition 
of autologous APC (from donor D72) as well as against a panel of APC partially 
matched with D72 at class II alleles. Table 2 shows the HLA class typing of the APC 
tested. Adherent monocytes (generated by 2 hour adherence) from four different 

20 donors, referred to as D45, D187, D208, and D326, were used as APC in these 
experiments. Autologous APC were not included in the experiment. Each of the APC 
were pulsed with the relevant peptide (5a for AD-5 and 3e for 3A-7) or the irrelevant 
mammoglobin peptide at 10 fig/ml, and cultures were established for 10,000 T cells and 
about 20,000 APC/well. As shown in Table 3, specific proliferation and cytokine 

25 production could be detected only when partially matched donor cells were used as 
APC. Based on the MHC typing analysis, these results strongly suggest that the 
restricting allele for the L762-specific response of the AD-5 derived clones is HLA- 
DRB-1 101 and for the EA-7 derived clone the restricting allele is HLA DRB-0701 or 
DQB1-0202. 
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Table 2 - HLA Typing of APC 



DONOR 


DR 


DR 


DQ 


DQ 


D72 


Bl-1101 


Bl-0701 


Bl-0202 


Bl-0301 


D45 


-3 


-15 


Bl-0201 


Bl-0602 


D187 


-4 


-15 


-1 


-7 


D208 


Bl-1101 


Bl-0407 


-3 


-3 


D326 


B 1-0301 


Bl-0701 


Bl-0202 


Bl-0201 



1 
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EXAMPLE 11 

FUSION PROTEINS OF N-TERMDSFAL AND C-TERMINAL PORTIONS OF L763P 
In another embodiment, a Mycobacterium tuberculosis-derived 
polynucleotide, referred to as Ral2, is linked to at least an immunogenic portion of a 
5 polynucleotide of this invention. Ral2 compositions and methods for their use in 
enhancing expression of heterologous polynucleotide sequences are described in U.S. 
Patent Application 60/158,585, the disclosure of which is incorporated herein by 
reference in its entirety. Briefly, Ral2 refers to a polynucleotide region that is a 
subsequence of a Mycobacterium tuberculosis MTB32A nucleic acid. MTB32A is a 

10 serine protease of 32 KD molecular weight encoded by a gene in virulent and avirulent 
strains of M tuberculosis. The nucleotide sequence and amino acid sequence of 
MTB32A have been described (for example, U.S. Patent Application 60/158,585; see 
also, Skeiky et al. y Infection and Immun. (1999) 67:3998-4007, incorporated herein by 
reference). Surprisingly, it was discovered that a 14 KD C-terminal fragment of the 

15 MTB32A coding sequence expresses at high levels on its own and remains as a soluble 
protein throughout the purification process. Moreover, this fragment may enhance the 
immunogenicity of heterologous antigenic polypeptides with which it is fused. This 14 
KD C-terminal fragment of the MTB32A is referred to herein as Ral2 and represents a 
fragment comprising some or all of amino acid residues 192 to 323 of MTB32A. 

20 Recombinant nucleic acids 1 which encode a fusion polypeptide 

comprising a Ral2 polypeptide and a heterologous lung tumor polypeptide of interest, 
can be readily constructed by conventional genetic engineering techniques. 
Recombinant nucleic acids are constructed so that, preferably, a Ral2 polynucleotide 
sequence is located 5* to a selected heterologous lung tumor polynucleotide sequence. 

25 It may also be appropriate to place a Ral2 polynucleotide sequence 3' to a selected 
heterologous polynucleotide sequence or to insert a heterologous polynucleotide 
sequence into a site within a Ral2 polynucleotide sequence. 

In addition, any suitable polynucleotide that encodes a Ral2 or a portion 
or other variant thereof can be used in constructing recombinant fusion polynucleotides 

30 comprising Ral2 and one or more lung tumor polynucleotides disclosed herein. 
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Preferred Ral2 polynucleotides generally comprise at least about 15 consecutive 
nucleotides, at least about 30 nucleotides, at least about 60 nucleotides, at least about 
100 nucleotides, at least about 200 nucleotides, or at least about 300 nucleotides that 
encode a portion of a Ral2 polypeptide. 
5 Ral2 polynucleotides may comprise a native sequence (i.e., an 

endogenous sequence that encodes a Ral2 polypeptide or a portion thereof) or may 
comprise a variant of such a sequence. Ral2 polynucleotide variants may contain one 
or more substitutions, additions, deletions and/or insertions such that the biological 
activity of the encoded fusion polypeptide is not substantially diminished, relative to a 

1 0 fusion polypeptide comprising a native Ral2 polypeptide. Variants preferably exhibit at 
least about 70% identity, more preferably at least about 80% identity and most 
preferably at least about 90% identity to a polynucleotide sequence that encodes a native 
Ral2 polypeptide or a portion thereof. 

Two specific embodiments of fusions between Ral2 and antigens of the 

1 5 present invention are described in this example. 

A. N-Terminal Portion of L763P 

A fusion protein of full-length Ral2 and the N-terminal portion of L763P (referred to as 
L763P-N; amino add residues 1-130 of SEQ ID NO: 159) was expressed as a single 
recombinant protein in E. coli. The cDNA for the N-terminal portion was obtained by 

20 PCR with a cDNA for the full length L763P and primers L763F3 (5' 
CGGCGAATTCATGGATTGGGGGACGCTGC; SEQ ID NO: 383) and 1763RV3 (5' 
CGGCCTCGAGTCACCCCTCTATCCGAACCTTCTGC; SEQ ID NO: 384). The 
PCR product with expected size was recovered from agarose gel, digested with 
restriction enzymes EcoRI and Xhol, and cloned into the corresponding sites in the 

25 expression vector pCRXl. The sequence for the fusion of full-length of Ral2 and 
L763P-N was confirmed by DNA sequencing. The determined cDNA sequence is 
provided in SEQ ID NO:351, with the corresponding amino acid sequence being 
provided in SEQ ID NO: 352). 
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B. C-Terminal Portion of L763P 

— — — ■ 

A fusion protein of fall-length Ral2 and the C-terminal portion of L763P 
(referred to as L763P-C; amino acid residues 100-262 of SEQ ID NO: 159) was 
expressed as a single recombinant protein in E colL The cDNA of the C-terminal 
5 portion of L763P was obtained by PGR with a cDNA for the fall length of L763P and 
primers L763F4 (5' CGGCGAATTCCACGAACCACTCGCAAGTTCAG; SEQ ID 
NO: 385) and L763RV4 (5* CGGCTCGAG-TTAGCTTGGGCCTGTGATTGC; Se($ 
ID NO: 386). The PCR product with expected size was recovered from agarose gel, 
digested with restriction enzymes EcoRI and Xhol, and cloned into the corresponding 
10 sites in the expression vector pCRXl, The sequence for the fusion of fall-length Ral2 
and L763P-C was confirmed by DNA sequencing. The determined DNA sequence is 
provided in SEQ ID NO:353, with the corresponding amino acid sequence being 
provided in SEQ ID NO: 354. 

The recombinant proteins described in this example are useful for the 
15 preparation of vaccines, for antibody therapeutics, and for diagnosis of lung tumors. 

EXAMPLE 12 

EXPRESSION IN K CPU OF L762P HIS TAG FUSION PROTEIN 
20 PCR was performed on the L762P coding region with the following 

primers: 

Forward primer starting at amino acid 32. 

PDM-278 . 5'ggagtacagcttcaagacaatggg 3* (SEQ ID NO:355) Tm 57°C. 
Reverse primer including natural stop codon after amino acid 920, 
25 creating EcoRI site 

PDM-280 S'ccatgggaattcattataataattttgttcc 3' (SEQ ID NO:356) 

TM55°C. 

The PCR product was digested with EcoRI restriction enzyme, gel 
purified and then cloned into pPDM His, a modified pET28 vector with a His tag in 
30 frame, which had been digested with Eco72I and EcoRI restriction enzymes. The 
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correct cpnstruct was confirmed by DNA sequence analysis and then transformed into 
BL21 (DE3) pLys S and BL21 (DE3) CodonPlus RIL expression hosts. 

The protein sequence of expressed recombinant L762P is shown in SEQ 
ID NO:357, and the DNA sequence is shown in SEQ ID NO:358. 

5 

EXAMPLE 13 

EXPRESSION IN & COLI OF A L773PA HIS TAG FUSION PROTEIN 

The L773PA coding region (encoding amino acids 2-7 1 of SEQ ID NO: 
10 1 72) was PCR amplified using the following primers: 

Forward primer for L773PA starting at amino acid 2: 

PDM-299 5'tggcagcccctcttcttcaagtggc 3' (SEQ ID NO:359) Tm63°C. 

Reverse primer for L773PA creating artificial stop codon after amino 

acid 70: 

15 PDM-355 5'cgccagaattcatcaaacaaatctgttagcacc 3' (SEQ ID NO:360) 

Tm62°C. 

The resulting PCR product was digested with EcoRI restriction enzyme, 
gel purified and then cloned into pPDM His, a modified pET28 vector with a His tag in 
frame, which had been digested with Eco72I and EcoRI restriction enzymes. The 
20 correct construct was confirmed by DNA sequence analysis and transformed into BL21 
(DE3) pLys S and BL21 (DE3) CodonPlus RIL expression hosts. 

The protein sequence of expressed recombinant L773PA is shown in 
SEQ ID NO:361, and the DNA sequence is shown in SEQ ID NO:362. 

25 

EXAMPLE 14 

IDENTIFICATION OF EPITOPES DERIVED FROM LUNG TUMOR SPECIFIC 

POLYPEPTIDES 

A series of peptides from the L773P amino acid sequence (SEQ ID NO: 
30 172) were synthesized and used in in vitro priming experiments to generate peptide- 
specific CD4 T cells. These peptides were 20-mers that overlapped by 15 amino acids 
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and corresponded to amino acids 1-69 of the L773P protein. This region has been 
demonstrated to be tumor-specific. Following three in vitro stimulations, CD4 T cell 
lines were identified that produced EFNy in response to the stimulating peptide but not 
the control peptide. Some of these T cell lines demonstrated recognition of recombinant 
5 L773P and L773PA (tumor-specific region) proteins. 

To perform the experiments, a total of eleven 20-mer peptides (SEQ ID 
NOs: 363, 365 and 387-395) overlapping by 15 amino acids and derived from the N- 
terminal tumor-specific region of L773P (corresponding to amino acids 1-69 of SEQ ID 
NO:172) were generated by standard procedures. Dendritic cells were derived from 

1 0 PBMC of a normal donor using GMCSF and IL-4 by standard protocol. Purified CD4 T 
cells were generated from the same donor as the dendritic cells using MACS beads and 
negative selection of PBMCs. Dendritic cells were pulsed overnight with the individual 
20-mer peptides at a concentration of 10 ng/ml. Pulsed dendritic cells were washed and 
plated at 1 x 10 4 Avell of a 96-well U-bottom plates, and purified CD4 cells were added 

15 at 1 x 10 5 well. Cultures were supplemented with 10 ng/ml IL-6 and 5 ng/ml IL-12, and 
incubated at 37°C. Cultures were re-stimulated as above on a weekly basis using as 
APC dendritic cells generated and pulsed as above, supplemented with 5 ng/ml YL-7 and 
10 jig/ml IL-2. Following 3 in vitro stimulation cycles, cell lines (each corresponding to 
one well) were tested for cytokine production in response to the stimulating peptide vs. 

20 an irrelevant peptide. i 

A small number of individual CD4 T cell lines (9/528) demonstrated 
cytokine release (JFNy) in response to the stimulating peptide but not to control peptide. 
The CD4 T cell lines that demonstrated specific activity were restimulated on the 
appropriate L773P peptide and reassayed using autologous dendritic cells pulsed with 

25 10 fig/ml of the appropriate L773P peptide, an irrelevant control peptide, recombinant 
L773P protein (amino acids 2-364, made in E. coli), recombinant L773PA (amino acids 
2-71, made in E. coli), or an appropriate control protein (L3E, made in K coli). Three 
of the nine lines tested (1-3C, 1-6G, and 4-12B) recognized the appropriate L773P 
peptide as well as recombinant L773P and L773PA. Four of the lines tested (4-8 A, 4- 

30 8E, 4-12D, and 4-12E) recognized the appropriate L773P peptide only. Two of the 
lines tested (5-6F and 9-3 B) demonstrated non-specific activity. 

145 



WO 02/00174 



PCT/US01/21065 



These results demonstrate that the peptide sequences 
MWQPLFFKWLLSCCPGSSQI (amino acids 1-20 of SEQ ID NO: 172; SEQ ID 
NO:363) and GSSQIAAAASTQPEDDINTQ (amino acids 16-35 of SEQ ID NO: 172; 
SEQ ID NO: 365) may represent naturally processed epitopes of L773P, which are 
5 capable of stimulating human class II MHC-restricted CD4 T cell responses. 

In subsequent studies, the above epitope mapping experiment was 
repeated using a different donor. Again, some of the resulting T cell lines were found to 
respond to peptide and recombinant protein. An additional peptide was found to be 
naturally processed. Specifically, purified CD4 cells were stimulated on a total of 

10 eleven 20-mer peptides overlapping by 1 5 amino acids (SEQ ID NO: 363, 387, 388, 365 
and 389-395, respectively). The priming was carried out as described above, except that 
a peptide concentration of 0.5 ug/mL rather than 10 ug/mL was employed. In the initial 
screen of the cell lines 9 of the 528 lines released at least a three-fold greater level of 
EFN-gamma with stimulating peptide vs. control peptide. These 9 lines were 

1 5 restimulated on the appropriate peptide and then tested on dendritic cells pulsed with a 
titration of appropriate peptide (10 ug/mL, 1 ugfaaL and 0.1 ug/mL), and 10 ug/mL of a 
control peptide. Six of the 9 lines recognized recombinant L773P as well as peptide. 
The six lines referred to as 1-1E, 1-2E, 1-4H, 1-6A, 1-6G and 2-12B recognized 
L773PA and the appropriate peptide. These results demonstrate that the peptides of 

20 SEQ ID NO: 363 and 387 represent naturally processed epitopes of L773P. 

Using the procedures described above, CD4+ T cell responses were 
generated from PBMC of normal donors using dendritic cells pulsed with overlapping 
20rmer peptides (SEQ ID NO: 396-419) spanning the L523S polypeptide sequence 
(SEQ ID NO: 176). A number of CD4+ T cells demonstrated reactivity with the 

25 priming peptides as well as with L523S recombinant protein, with the dominant 
reactivity of these lines being within the peptides 4, 7 and 21 (SEQ ID NO: 399, 402 
and 416; corresponding to amino acids 30-39, 60-79 and 200-219, respectively, of SEQ 
ID NO: 176). 

Epitopes within the scope of the invention include epitopes restricted by 
30 other class II MHC molecules. In addition, variants of the peptide can be produced 
wherein one or more amino acids are altered such that there is no effect on the ability of 
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the peptides to bind to MHC molecules, no effect on their ability to elicit T cell 
responses, and no effect on the ability of the elicited T cells to recognize recombinant 
protein. 

5 

EXAMPLE 15 
SURFACE EXPRESSION OF L762P AND 
ANTIBODY EPITOPES THEREOF 
Rabbits were immunized with full-length histidine-tagged L762P protein 

10 generated in E. colt Sera was isolated from rabbits and screened for specific 
recognition of L762P in ELISA assays. One polyclonal serum, referred to as 2692L, was 
identified that specifically recognized recombinant L762P protein. The 2692L anti- 
L762P polyclonal antibodies were purified from the serum by affinity purification using 
L762P affinity columns. Although L762P is expressed in a subset of primary lung tumor 

15 samples, expression appears to be lost in established lung tumor cell lines. Therefore, 
to characterize surface expression of L762P, a retrovirus construct that expresses L762P 
was used to transduce primary human fibroblasts as well as 3 lung tumor cell lines (522- 
23, HTB, and 343T). Transduced lines were selected and expanded to examine L762P 
surface expression by FACS analysis. For this analysis, non-transduced and transduced 

20 cells were harvested using cell dissociation medium, and incubated with 10-50 
micrograms/ml of either affinity purified anti-L762P or irrelevant antisera. Following a 
30 minute incubation on ice, cells were washed and incubated with a secondary, FITC 
conjugated, anti rabbit IgG antibody as above. Cells wore washed, resuspended in buffer 
with Propidium Iodide (PI) and examined by FACS using an Excalibur fluorescence 

25 activated cell sorter. For FACS analysis, Pi-positive (i.e. dead/penneabilized cells) were 
excluded. The polyclonal anti-L762P sera specifically recognized and bound to the 
surface of L762P-transduced cells but not the non-transduced counterparts. These 
results demonstrate that L762P is localized to the cell surface of both fibroblasts as well 
as lung tumor cells. 

30 To identify the peptide epitopes recognized by 2692L, an epitope 

mapping approach was pursued. A series of overlapping 19-21 mers (5 amino acid 
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overlap) was synthesized that spanned the C terminal portion of L762P (amino acids 
481-894 of SEQ ID NO: 161). In an initial experiment peptides were tested in pools. 
Specific reactivity with the L762P antiserum was observed with pools A, B, C, and E. 
To identify the specific peptides recognized by the antiserum, flat bottom 96 well 
5 microtiter plates were coated with individual peptides at 1 0 microgram/ml for 2 hours at 
37 °C. Wells were then aspirated and blocked with phosphate buffered saline containing 
5% (w/v) milk for 2 hours at 37 °C, and subsequently washed in PBS containing 0.1% 
Tween 20 (PBST). Purified rabbit anti-L762P serum 2692L was added at 200 or 20 
ng/well to triplicate wells in PBST and incubated overnight at room temperature. This 
10 was followed by washing 6 times with PBST and subsequently incubating with HRP- 
conjugated donkey anti rabbit IgG (H+L)Affinipure F(ab') fragment at 1:2,000 for 60 
minutes. Plates were then washed, and incubated in tetramethyl benzidine substrate. 
Reactions were stopped by the addition of IN sulfuric acid and plates were read at 
450/570 nm using an ELISA plate reader. 

15 The resulting data, presented in Table 4 below, demonstrates that the 

L762P antisera recognized at least 6 distinct peptide epitopes from the 3' half of L762P. 



148 



WO 02/00174 



PCT/US01/21065 



Table 4 



ELISA activity (OP 450-570 



Peptide (starting 
amino acid of L762P) 


pool 


200 ng 
polyclonal serum 


20 ng 
polyclonal serum 


A (481).* . 


f •> A - f; 


\ : t76.' v 


1.Q ■ 


B (495) 


A 


0.14 


.06 


- i / C(51i) v 




"zzX SMI 


0,18,:: ^ 


D(526) 


£ 


0.11 


0.09 


E (541) 


A 


0.11 


0.04 


F(556) 


A 


0.04 


0.02 


G (571) 


A 


0.06 


0.02 


H(586) v 


B 


0.1 


0.03 


r 1(601) 






a QM -v ^ 


J (616) 


B 


0.1 


0.03 


K(631) 


E 


L 0.1 


0.08 


.? . . t(0#0)^ 






-f / : - <Rtt :- 


M (661) 


B 


0.14 


0.03 


N (676) 


C 


0.12 


0.1 


-;; -j +Qmm ■>.*■ 






; on > ^ 


P(706) 


r c 


0.1 


0.03 


Q(721) 


c 


0.11 


0.05 


R(736) 


E 


0.12 


0.04 


S (751) 


C 


0.15 


0.06 


U(781) 


D 


0.12 


0.06 


V(795) 


F 


0.07 


0.05 


X(826) 


D 


0.1 


0.03 


Y (841) 


D 


0.17 


0.07 


Z(856) 


D 


0.16 


0.08 


AA (871) 


F 


0.17 


0.05 


A* BB(874)- . - r '; 






, y 0J1 .-> r . 


No peptide 




0.15 


0.045 



Individual peptides were identified from each of the pools, and 
5 additionally a weak reactivity was identified with peptide BB from pool F. The relevant 
peptide epitopes are summarized in the Table 5 below The amino acid sequences for 
peptides BB, O, L, I, A and C are provided in SEQ ID NO: 376-381, respectively, with 
the corresponding cDNA sequences being provided in SEQ ID NO: 373, 370, 372, 374, 
371 and 375, respectively. 

10 
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Table 5 



EL ISA activity 
i 01> 450-5570* 


Peptide 


Nucleotides 
ofL762P 


Amino 
acids of 
L762P 


Sequence 


pool 


200 ng 


20 ng 


A 


1441-1500 


481-500 


SRISSGTGDIFQQHIOLEST 


A 


1.76 


1.0 


C 


1531-1590 


511-530 


KNTVTVDNTVGNDTMFLVTW 


E 


0.47 


0.18 


I 


1801-1860 


601-620 


AVPPATVEAFVERDSLHFPH 


B 


0.25 


0.06 


L 


1936-1955 


646-665 


PETGDPVTLRLLDDGAGADV 


B 


0.28 


0.12 


0 


2071-2130 


691-710 


VNHSPSISTPAHSIPGSHAM1L 


C 


1.1 


0.23 


BB 


2620-2679 


874-893 


LQSAVSNIAQAPLFIPPNSD 


F 


0.14 


0.11 


None 










0.15 


0.05 



EXAMPLE 16 

5 DETECTION OF ANTIBODIES AGAINST LUNG TUMOR ANTIGENS 

IN PATIENT SERA 

Antibodies specific for the lung tumor antigens L773PA (SEQ ID 
NO:361), L514S (SEQ ID NO:155 and 156), L523S (SEQ ID NO:176), L762P (SEQ ID 

10 NO:161) and L763P (SEQ ID NO:159) were shown to be present in effusion fluid or 
sera of lung cancer patients but not in normal donors. More specifically, the presence of 
antibodies against L773PA, L514S, L523S, L762P and L763P in effusion fluid obtained 
from lung cancer patients and in sera from normal donors was detected by ELISA using 
recombinant proteins and HRP-conjugated anti-human Ig. Briefly, each protein (1.00 

15 ng) was coated in 96-well plate at pH 9.5. In parallel, BSA (bovine serum albumin) was 
also coated as a control protein. The signals ([S], absorbance measured at 405 nm) 
against BSA ([N]) were determined. The results of these studies are shown in Table 6, 
wherein - represents [S]/[N] < 2; +/- represents [S]/[N] >2; ++ represents [S]/[N] >3; 
and +++ represents [S]/(N] >5. 

20 
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Table 6 - Detection of Antibodies against Lung Tumor Antigens 





T C1/1C 


T COO O 


L7o2r 




T TTIDA 

L,//irA 


Effusion 
tluid 












4/1 
#1 


T TT 


t y 
T r 


TT 




■It. 
TT 


4/0 






-1-/ 


t 1 
TT 


4-/ 
T/- 


#3 










t/- 


#4 


4-/ 
T/- 


t i 
TT 


T/- 




T/- 


J1C 

#5 


i / 


t i i 
TTT 




t/- 


t t 
TT 


44*7 
#7 










» / 

+/- 


#8 




1 1 1 






_t_ i. 
Tt 


#10 




i i 
TT 


+/- 






111 1 

#ll 


+/- 


ff 


TT 




++ 


#12 


4 1 i 






+/- 


+/- 


#13 




+/- 






+/- 


#14 




.111 


+/- 


+/- 


-H- 


lit c 

#15 


+/- 


+T 


+/- 




++ 


4/n 
#1 / 




-4-/ 
T/- 






4-/ 
T/- 


#18 




++ 








#19 




+/- 






+/- 


#20 


+/- 


+/- 


+/- 




+/- 


Normal sera 












#21 




+/- 








#22 












#23 










+/- 


#24 




+A 








#25 


+/- 


+A 


f 




+/- • 



Using Western blot analyses, antibodies against L523S were found to be 
5 present in 3 out of 4 samples of effusion fluid from lung cancer patients, with no L523S 
antibodies being detected in the three samples of normal sera tested. 



EXAMPLE 17 

10 EXPRESSION IN K COJJOV A L514S HIS TAG FUSION PROTEIN 

PGR was performed on the L514S-13160 'coding region with the 
following primers: 
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Forward primer PDM-278 5' cacactagtgtccgcgtggcggcctac 3' (SEQ ID 
NO:421)Tm67°C. 

Reverse primer PDM-280 5' catgagaattcatcacatgcccttgaaggctccc 3' 
(SEQ ID NO:422) TM 66°C. 
5 The PCR conditions were as follows: 

lOjil lOXPfii buffer 
1.0^1 lOmMdNTPs 
2.0^1 lOyM each primer 
83^x1 sterile water 

10 1 .5jil Pfu DNA polymerase (Stratagene, La Jolla, CA) 

SOrjgDNA 

96°C for 2 minutes, 96°C for 20 seconds, 66°C for 15 seconds, 72°C for 
1 minute with 40 cycles and then 72°C for 4 minutes. 

The PCR product was digested with EcoRI restriction enzyme, gel 
15 purified and then cloned into pPDM His, a modified pET28 vector with a His tag in 
frame, which had been digested with Eco72I and EcoRI restriction enzymes. The 
correct construct was confirmed by DNA sequence analysis and then transformed into 
BL21 CodonPlus (Stratagene, La Jolla, CA) cells for expression. 

The amino acid sequence of expressed recombinant L514S is shown in 
20 SEQ ID NO:423, and the DNA coding region sequence is shown in SEQ ID NO:424. 



EXAMPLE 18 

EXPRESSION IN R CPU OF A L523S HIS TAG FUSION PROTEIN 
25 PCR was performed on the L523S coding region with the following 

primers: 

Forward primer PDM-414 5' aacaaactgtatatcggaaacctcagcgagaa 3' (SEQ 
IDNO:425)Tm62°C. 

Reverse primer PDM-415 5' ccatagaattcattacttccgtcttgactgagg 3' (SEQ 
30 IDNO:426)TM62°C. 
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, The PCR conditions were as follows: 
10|xl 10X Pfu buffer 
1.0^1 lOmM dNTPs 
2.0^1 10|iM each primer 
5 83pl sterile water 

1.5^1 Pfu DNA polymerase (Stratagene, La Jolla, CA) 
SOrigDNA 

96°C for 2 minutes, 96°C for 20 seconds, 62°C for 15 seconds, 72°C for 
4 minutes with 40 cycles and then 72°C for 4 minutes. 

10 The PCR product was digested with EcoRI restriction enzyme, gel 

purified and then cloned into pPDM His, a modified pET28 vector with a His tag in 
frame, which had been digested with Eco72I and EcoRI restriction enzymes. The 
correct construct was confirmed by DNA sequence analysis and then transformed into 
BL21 CodonPlus (Stratagene, La Jolla, CA) cells for expression. 

15 The amino acid sequence of expressed recombinant L523S is shown in 

SEQ ID NO:427, and the DNA coding region sequence is shown in SEQ ID NO:428. 

EXAMPLE 19 

20 EXPRESSION IN E. CPU OF A L762PA HIS TAG FUSION PROTEIN 

PCR was performed on the L762PA coding region (L762PA is missing 
the signal sequence, the C-terminal transmembrane domain and the cytoplasmic tail) 
with the following primers: 

Forward primer PDM-278 5'ggagtacagcttcaagacaatggg 3' (SEQ ID 
25 NO:355)Tm57°C. 

Reverse primer PDM-279 5'ccatggaattcattatttcaatataagataatctc 3' (SEQ 
IDNO:429)TM56 0 C. 

The PCR conditions were as follows: , 
lOjxl 10X Pfu buffer 
30 LOpllOmM dNTPs 
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2.0^1 10^M each primer 
83jil sterile water 

1 .5|J Pfu DNA polymerase (Stratagene, La Jolla, CA) 
50rig DNA 

5 96°C for 2 minutes, 96°C for 20 seconds, 55°C for 15 seconds, 72°C for 

5 minutes with 40 cycles and then 72°C for 4 minutes. 

The PCR product was digested with EcoRI restriction enzyme, gel 
purified and then cloned into pPDM His, a modified pET28 vector with a His tag in 
frame, which had been digested with Eco72I and EcoRI restriction enzymes. The 
10 correct construct was confirmed by DNA sequence analysis and then transformed into 
BL21 pLys S (Novagen, Madison, WI) cells for expression. 

The amino acid sequence of expressed recombinant L762PA is shown in 
SEQ ID NO.430, and the DNA coding region sequence is shown in SEQ ID NO:43 1 . 

15 

EXAMPLE 20 

EXPRESSION IN R CPU OF A L773P HIS TAG FUSION PROTEIN 

PCR was performed on the L773P coding region with the following 

primers: 

20 Forward primer PDM-299 5* tggcagcccctcttcttcaagtggc 3' (SEQ ID 

NO:359)Tm63°C. 

Reverse primer PDM-300 5' cgcctgctcgagtcattaatattcatcagaaaatgg 3' 
(SEQ ID NO:432) TM 63°C. 

The PCR conditions were as follows: 
25 10pJ 10X Pfu buffer 

1.0^1 lOmMdNTPs 
2.0^1 IOjiM each primer 
83^1 sterile water 

1 .5^1 Pfu DNA polymerase (Stratagene, La Jolla, CA) 
30 50r)g DNA 
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/ 96°C for 2 minutes, 96°C for 20 seconds, 63 °C for 15 seconds, 72°C for 
2 minutes 1 5 seconds with 40 cycles and then 72°C for 4 minutes. 

The PCR product was digested with EcoRI restriction enzyme, gel 
purified and then cloned into pPDM His, a modified pET28 vector with a His tag in 
5 frame, which had been digested with Eco72I and EcoRI restriction enzymes. The 
correct construct was confirmed by DNA sequence analysis and then transformed into 
BL21 pLys S (Novagen, Madison, WI) and BL21 CodonPlus (Stratagene, La Jolla, CA) 
cells for expression. 

The amino acid sequence of expressed recombinant L773P is shown in 
10 SEQ ID NO:433, and the DNA coding region sequence is shown in SEQ ID NO:434. 

EXAMPLE 21 

CLONING AND SEQUENCING OF A T-CELL RECEPTOR CLONE FOR THE 
15 LUNG SPECIFIC ANTIGEN L762P 

T cell receptor (TCR) alpha and beta chains from a CD4 T cell clone 
specific for the lung specific antigen L762P were cloned and sequence. Basically, total 
mRNA from 2 X 10 6 cells from CTL clone 4H6 was isolated using Trizol reagent and 
cDNA was synthesized using Ready-to go kits (Pharmacia). To determine Valpha and 
20 Vbeta sequences of this clone, a panel of Valpha and Vbeta subtype specific primers 
was synthesized and used in RT-PCR reactions with cDNA generated from each of the 
clones. The RT-PCR reactions demonstrated that each of the clones expressed a 
common Vbeta sequence that corresponded to 1he Vbeta8 subfamily and a Valpha 
sequence that corresponded to the Valpha8 subfamily. To clone the full TCR alpha and 
25 beta chains from clone 4H6, primers were designed that spanned the initiator and 
terminator-coding TCR nucleotides. The primers were as follows: 

forward primer for TCR Valpha8 5' 

ggatccgccgccaccatgacatccattcgagctgta 3' (SEQ ID NQ:435; has a BamHI site 
inserted); 
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/ Kozak reverse primer for TCR Valpha8 (antisense) 5' 
gtcgactcagctggaccacagccgcag 3' (SEQ ID NO:436; has a Sail site inserted plus 

i 

the TCR alpha constant sequence); 

forward primer for TCR Vbeta8 (sense) 5' 
5 ggatccgccgccaccatggactcctggaccttctgct 3' (SEQ ID NO:437; has a BamHI site 

inserted); and 

Kozak reverse primer for TCR Vbeta 5' gtcgactcagaaatcctttctcttgac 3* 
(SEQ ID NO:438; has a Sail site inserted plus the TCR beta constant sequence). 
Standard 35 cycle RT-PCR reactions were established using the cDNA synthesized 

10 from the CTL clone and the above primers utilizing the proofreading thermostable 
polymerase, PWO (Roche). The resultant PCR band, about 850 bp for Valpha and 
about 950 for Vbeta, was ligated into a PCR blunt vector (Invitrogen) and transformed 
into R coli. E coli transformed with plasmids having full-length alpha and beta chains 
were identified.. Large scale preparations of the corresponding plasmids were 

15 generated, and these plasmids were sequenced. The Valpha sequence (SEQ ID 
NO:439) was shown by nucleotide sequence alignment to be homologous to Valpha8.1, 
while the Vbeta sequence (SEQ ID NO:440) was shown by nucleotide sequence 
alignment to be homologous to Vbeta82. 

20 

EXAMPLE 22 

RECOMBINANT EXPRESSION OF FULL LENGTH L762P IN MAMMALAIN 

CELLS 

Full length L762P cDNA was subcloned into the mammalian expression 
25 vectors VR1012 and pCEP4 (Invitrogen). Both expression vectors had previously been 
modified to contain a FLAG epitope tag. These constructs were transfected into 
HEK293 and CHL-1 cells (ATCC) using Lipofectamine 2000 reagent (Gibco). Briefly, 
both the HEK and CHL-1 cells were plated at a density of 100,000 cells/ml in DMEM 
(Gibco) containing 10% FBS (Hyclone) and grown overnight. The following day, 4ui 
30 of Lipofectamine 2000 was added to 1 OOul of DMEM containing no FBS and incubated 
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for 5 miijutes at room temperature. The Lipofectamine/DMEM mixture was then added 
to Ipg of L762P Flag/pCEP4 or L762P Flag/VR1012 plasmid DNA resuspended in 
lOOjxl DMEM and incubated for 15 minutes at room temperature. The 
Lipofectamine/DNA mix was then added to the HEK293 and CHL-1 cells and 
5 incubated for 48-72 hours at 37°C with 7% C0 2 . Cells were rinsed with PBS, then 
collected and pelleted by centrifugation. L672P expression was detected in the 
transfected HEK293 and CHL-1 cell lysates by Western blot analysis and was detected 
on the surface of transfected HEK cells by flow cytometry analysis. 

For Western blot analysis, whole cell lysates were generated by 

10 incubating the cells in Triton-XlOO containing lysis buffer for 30 minutes on ice. 
Lysates were then cleared by centrifugation at 10,000 rpm for 5 minutes at 4°C. 
Samples were diluted with SDS-PAGE loading buffer containing beta-mercaptoethanol, 
then boiled for 10 minutes prior to loading the SDS-PAGE gel. The protein was 
transferred to nitrocellulose and probed using 1 fig/ml purified anti-L762P rabbit 

15 polyclonal sera (lot #690/73) or non-diluted anti-L762P mAb 153.20.1 supernatant. 
Blots were revealed using either goat anti-rabbit Ig coupled to HRP or goat anti-mouse 
Ig coupled to HRP followed by incubation in ECL substrate. 

For flow cytometric analysis, cells were washed further with ice cold 
staining buffer (PBS+1%BSA +Azide). Next, the cells were incubated for 30 minutes 

20 on ice with lOug/ml of purified anti-L762P polyclonal sera (lot #690/73) or a 1:2 
dilution of anti-L762P mAb 153.20.1 supernatant. The cells were washed 3 times with 
staining buffer and then incubated with a 1:100 dilution of goat anti-rabbit Ig(H+L)- 
FITC or goat anti-mouse Ig(H+L)-FITC reagent (Southern Biotechnology) for 30 
minutes on ice. After 3 washes, the cells were resuspended in staining buffer containing 

25 propidium iodide (PI), a vital stain that allows for the exclusion of permeable cells, and 
analyzed by flow cytometry. 
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EXAMPLE 23 

GENERATION OF POLYCLONAL ANTIBODIES TO LUNG TUMOR ANTIGENS 
Three lung antigens, L523S (SEQ ID NO:176), L763P (SEQ ID NO:159) 
and L763 peptide #2684 (SEQ ID NO:44l), were expressed and purified for use in 
5 antibody generation. 

L523S and L763P were expressed in an E. coli recombinant expression 
system and grown overnight in LB Broth with the appropriate antibiotics at 37°C in a 
shaking incubator. The next morning, 10 ml of the overnight culture was added to 500 
ml of 2x YT with the appropriate antibiotics in a 2L-baffled Erienmeyer flask. When 
10 the optical density of the culture reached 0.4-0.6 at 560 nanometers, the cells were 
induced with IPTG (1 mM). Four hours after induction with IPTG, the cells were 
harvested by centrifugation. 

The cells were then washed with phosphate buffered saline and 
centrifuged again. The supernatant was discarded and the cells were either frozen for 
1 5 future use or immediately processed. Twenty milliliters of lysis buffer was added to the 
cell pellets and vortexed. To break open the E. coli cells, this mixture was then run 
through a french press at a pressure of 16,000 psi. The cells were then centrifuged again 
and the supernatant and pellet were checked by SDS-PAGE for the partitioning of the 
recombinant protein. 

20 For proteins that localized to the cell pellet, the pellet was resuspended in 

10 mM Tris pH 8.0, 1% CHAPS and the inclusion body pellet was washed and 
centrifuged again. This procedure was repeated twice more. The washed inclusion body 
pellet was solubilized with either 8M urea or 6M guanidine HC1 containing 10 mM Tris 
pH 8.0 plus 10 mM imidazole. The solubilized protein was added to 5 ml of nickel- 

25 chelate resin (Qiagen) and incubated for 45 minutes to 1 how at room temperature with 
continuous agitation. 

After incubation, the resin and protein mixture was poured through a 
disposable column and the flow through was collected. The 9olumn was then washed 
with 10-20 column volumes of the solubilization buffer. The antigen was then eluted 

30 from the column using 8M urea, 10 mM Tris pH 8.0 and 300 mM imidazole and 
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collected^ 3 ml fractions. A SDS-PAGE gel was run to determine which fractions to 
pool for further purification. 

As a final purification step, a strong anion exchange resin, in this case 
Hi-Prep Q (Biorad), was equilibrated with the appropriate buffer and the pooled 
5 fractions from above were loaded onto the column. Each antigen was eluted off the 
column with an increasing salt gradient. Fractions were collected as the column was 
run and another SDS-PAGE gel was run to determine which fractions from the column 
to pool. 

The pooled fractions were dialyzed against 10 mM Tris pH 8.0. The 

10 release criteria were purity as determined by SDS-PAGE or HPLC, concentration as 
determined by Lowry assay or Amino Acid Analysis, identity as determined by amino 
terminal protein sequence, and endotoxin level was determined by the Limulus (LAL) 
assay. The proteins were then put in vials after filtration through a 0.22-micron filter 
and the antigens were frozen until needed for immunization. 

15 The L763 peptide #2684 was synthesized and conjugated to KLH and 

froze until needed for immunization. 

The polyclonal antisera were generated using 400 micrograms of each 
lung antigen combined with 100 micrograms of muramyldipeptide (MDP). An equal 
volume of Incomplete Freund's Adjuvant (IFA) was added and then mixed and injected 

20 subcutaneously (S.C.) into a rabbit. After foiir weeks, the rabbit was S.C. boosted wifli 
200 micrograms of antigen mixed with an equal volume of IFA. Thereafter the rabbit 
was I.V. boosted with 100 micrograms of antigen. The animal was bled seven days 
following each boost. The blood was then incubated at 4°C for 12-24 hours followed 
by centrifugation to generate the sera. 

25 The polyclonal antisera were characterized using 96 well plates coated 

with antigen and incubated with 50 microliters (typically 1 microgram/microliter) of the 
polyclonal antisera at 4°C for 20 hours. Basically, 250 microliters of BSA blocking 
buffer was added to the wells and incubated at room temperature for 2 hours. Plates 
were washed 6 times with PBS/0.1% Tween. The rabbit sera were diluted in PBS/0.1% 

30 Tween/0. 1 %BSA. 50 microliters of diluted sera was added to each well and incubated 
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at room temperature for 30 minutes. The plates were washed as described above, and 
then 50 microliters of goat anti-rabbit horseradish peroxidase (HRP) at a 1:10000 
dilution was added and incubated at room temperature for 30 minutes. 

The plates were washed as described above, and 100 microliters of TMB 
5 Miciowell Peroxidase Substrate was added to each welL Following a 15-minute 
incubation in the dark at room temperature, the colorimetric reaction was stopped with 
100 microliters of IN H 2 S0 4 and read immediately at 450 nm. All the polyclonal 
antibodies showed immunoreactivity to the appropriate antigen.Tables 7-9 show the 
antibody reactivity of rabbit antisera in serial dilution to the three lung antigens, L523S, 

10 L763P and L763 peptide #2684. The first column shows the antibody dilutions. The 
columns "Pre-immune sera'' indicate ELISA data for two experiments using pre- 
immune sera. These results are averaged in the fourth column. The columns "anti- 
L523S, L763P or #2684" indicate ELISA data for two experiments using sera from 
rabbits immunized as described in this Example, using the respective antigen, referred 

15 to as either L523S, L763P or #2684 in the tables. 
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Table 7 



Antibody 
dilution 


x re- 
lnijxiune 
ser«t \x) 


Jrre- 
i Dim une 

c? awn i / | 

sera \ 


Average 


Anti- 


Anti- 

T <»'>'IC 

tv\ 
W 


Average 


1: 


1000 


0.14 


0.14 


0.14 


2.36 


2.37 


2.37 


1: 


2000 


0.12 


0.10 


0.11 


2.29 


2.23 


2.26 


1: 


4000 


0.10 


0.09 


0.10 


2.11 


2.17 


2.14 


1: 


8000 


0.09 


0.09 


0.09 


1.98 


2.00 


1.99 


1: 


16000 


0.09 


0.09 


0.09 


1.73 


1.76 


1.75 


1: 


.32000 


0.09 


0.09 


0.09 


1.35 


1.40 


1.37 


1: 


64000 


0.09 


0.11 


0.10 


0.94 


0.98 


0.96 


1: 


128000 


0.09 


0.08 


0.08 


0.61 


0.61 


0.61 


1: 


.256000 


0.08 


0.08 


0.08 


0.38 


0.38 


0.38 


1. 


512000 


0.09 


0.08 


0.08 


0.24 


0.25 


0.25 


1: 


1024000 


0.08 


0.08 


0.08 


0.17 


0.17 


0.17 


1: 


2048000 


0.08 


0.08 


0.08 


0.14 


0.13 


0.13 



Table 8 

5 



Antibody 


Pre- 


Pre- 


Average 


Anti- 


Anti- 


Average 


dilution 


immune 


immune 




L763P 


L763P 








sera (1) 


sera (2) 




(1) 


(2) 




1 


1000 


0.09 


0.11 


0.10 


1.97 


1.90 


1.93 


1: 


2000 


0.07 


0.07 


0.07 


1.86 


1.84 


1.85 


1; 


4000 


0.06 


0.06 


0.06 


1.82 


1.81 


1.81 


1: 


:8000 


0.06 


0.06 


0.06 


1.83 


1.81 


1.82 


1: 


16000 


0.06 


0.05 


0.06 


1.79 


1.74 


1.76 


1: 


32000 


0.06 


0.06 


0.06 


1.56 


1.51 


1.53 


1: 


64000 


0.06 


0.05 


0.05 


1.35 


1.34 


1.35 


1: 


128000 


0.05* 


0.05 


0.05 


1.01 


0.98 


0.99 


1: 


256000 


0.06 


0.05 


0.05 


0.69 


0.70 


0.70 


1: 


.512000 


0.06 


0.05 


0.05 


0.47 


0.44 


0.46 


1, 


1024000 


0.06 


0.05 


0.06 


0.27 


0.27 


0.27 


1: 


:2048000 


0.05 


0.05 


0.05 


0.16 


0.15 


0.16 
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Table 9 



Antibody 


Pre- 


Pre- 


Average 


Anti- 


Anti- 


Average 


rlilii 4~inri 
UllUllUll 


JLUUfil UI1C 


immune 




#2684 


#2684 




sera (1) 


sera (2) 




a) 


(2) 




1:1000 


0.07 


0.07 


0.07 


2.10 


2.00 


2.05 


1:2000 


0.07 


0.06 


0.06 


1.95 


1.96 


1.95 


1:4000 


0.06 


0.06 


0.06 


1.77 


1.82 


1.79 


1:8000 


0.06 


0.06 


0.06 


1.79 


1.81 


1.80 


1:16000 


0.06 


0.06 


0.06 


1.54 


1.50 


1.52 


1:32000 


0.06 


0.06 


0.06 


1.27 


1.20 


1.24 


1:64000 


0.06 


0.06 


0.06 


0.85 


0.82 


0.83 


0 


0.06 


0.06 


0.06 


0.06 


0.06 


0.06 



5 Tables 10-12 show the affinity purification of the respective antibodies 

to the three lung antigens, L523S, L763P and L763 peptide #2684. 



Table 10 



Antibody 


Affinity 


Affinity 


Average 


Affinity 


Affinity 


Average 


cone. 


pure 


pure 




pure 


pure 




(ug/ml) 


(salt 


(salt 




(acid 


(acid 






peak) 


peak) 




peak) 


peak) 




1.0 


2.38 


2.35 


2.36 


2.25 


2.31 


228 


0.5 


2.24 


2.22 


2.23 


2.19 


2.18 


2.18 


0.25 


2.05 


2.09 


2:07 


2.01 


2.03 


2.02 


0.13 


1.70 


1.81 


1.75 


1.74 


1.74 


1.74 


0.063 


1.44 


1.44 


1.44 


1.43 


1.38 


1.40 


0.031 


1.05 


1.05 


1.05 


0.99 


0.99 


0.99 


0.016 


0.68 


0.67 


0.68 


0.65 


0.64 


0.64 


0.0078 


0.43 


0.42 


0.42 


0.39 


0.39 


0.39 


0.0039 


027 


0.26 


0.27 


024 


0.26 


0.25 


0.0020 


0.18 


0.20 


0.19 


0.19 


0.18 


0.19 


0.0010 


0.13 


0.14 


0.13 


0.13 


0.14 


0.13 


0.00 


0.11 


0.12 


0.11 


0.10 


0.12 


0.11 



1 
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Table 11 



Antibody 
dilution 


Affinity 
pure 


Affinity 
pure 


Average 


1 


.1000 


1.64 


1.77 


1.70 


1 


2000 


1.59 


1.76 


1.68 


1 


:4000 


1.48 


1.62 


1.55 


1 


:8000 


1.35 


1.43 


1.39 


1 


:16000 


1.09 


1.19 


1.14 


1 


32000 


0.81 


0.89 


0.85 


1 


:64000 


0.55 


0.58 


0.56 


1. 


128000 


0.31 


0.35 


0.33 


1 


:256000 


0.18 


0.20 


0.19 


1 


:512000 


0.11 


0.12 


0.11 


1 


:1024000 


0.07 


0.07 


0.07 


1 


:2048000 


0.06 


0.06 


0.06 | 



Table 12 

5 



Antibody 


Affinity 


Affinity 


Average 


cone, 
(ug/ml) 


pure 


pure 




1.0 


2.00 


, 2.02 


2.01 


0.5 


2.01 


1.93 


1.97 


025 


1.84 


1.83 


1.84 


0.13 


1.80 


1.83 


1.81 


0.06 


1.39 


1.60 


1.50 


0.03 


1.33 


1.35 


1.34 


0.02 


0.94 


0.93 


0.94 


0.00 


0.06 


0.06 


0.06 



EXAMPLE 24 
FULL-LENGTH cDNA SEQUENCE ENCODING L529S 
10 The isolation of a partial sequence (SEQ ID NO:106) for lung antigen 

L529S was previously provided in Example 2. This partial sequence was used as a 
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query tojdentify potential full length cDNA and protein sequences by searching against 
publicly available databases. The predicted full-length cDNA sequence for the isolated 
cloned sequence of SEQ ID NO: 106 is provided in SEQ ID NO:442. The deduced 
amino acid sequence of the antigen encoded by SEQ ID NO:442 is provided in SEQ ID 
5 NO:443. It was previously disclosed in Example 2 that L529S shows similarity to 
connexin 26, a gap junction protein. 

EXAMPLE 25 

EXPRESSION IN MEGATERIUM OF A HISTIDINE TAG-FREE L523S FUSION 
10 PROTEIN 

PCR was performed on the L523S coding region with the following 

primers: 

Forward prima* PDM-734 5' caatcaggcatgcacaacaaactgtatatcggaaac 3' 
1 5 (SEQ ID NO:444) Tm 63°C. 

Reverse primer PDM-735 5* cgtcaagatcttcattacttccgtcttgac 3' (SEQ ID 
NO:445)TM60°C. 

The PCR conditions were as follows: 
lOpl lOXPft buffer 
20 LOjollOmMdNTPs ' 

2.0jil IOjjM each primer 
83 (ol sterile water 

1 .5|J Pfu DNA polymerase (Stratagene, La Jolla, CA) 
50r|g DNA 

25 96°C for 2 minutes, 96°C for 20 seconds, 62°C for 1 5 seconds, 72°C for 

4 minute with 40 cycles and then 72°C for 4 minutes. 

The PCR product was digested with SphI and Bglll restriction enzymes, 
gel purified and then cloned into pMEG-3, which had been digested with SphI and Bglll 
restriction enzymes. The correct construct was confirmed by DNA sequence analysis 

30 and then transformed into Megaterium cells for expression. 
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The amino acid sequence of expressed recombinant L523S is shown in 
SEQ ID NO:446, and the DNA coding region sequence is shown in SEQ ID NO:447. 

5 EXAMPLE 26 

EXPRESSION IN E. COLI OF A HISTTDINE TAG-FREE L523S FUSION PROTEIN 

PCR was performed on the L552S coding region with the following 

primers: 

10 Forward primer PDM-733 5' cgtactagcatatgaacaaactgtatatcggaaac 3' 

(SEQ ID NO:448) Tm 64°C. 

Reverse primer PDM-415 5' ccatagaattcattacttccgtcttgactgagg 3* (SEQ 
IDNO:426)TM62°C. 

The PCR conditions were as follows: 
15 lOjillOXPfu buffer 

l.OfillOmMdNTPs 
2-Op.l IOjoM each primer 
83pl sterile water 

1 ,5|xl Pfu DNA polymerase (Stratagene, La Jolla, CA) 
20 50t|g DNA 

96°C for 2 minutes, 96°C for 20 seconds, 62°C for 15 seconds, 72°C for 
4 minute with 40 cycles and then 72°C for 4 minutes. 

The PCR product was digested with Ndel and EcoRI restriction 
enzymes, gel purified and then cloned into pPDM, a modified pET28 vector, which had 
25 been digested with Ndel and EcoRI restriction enzymes. The correct construct was 
confirmed by DNA sequence analysis and then transformed into BLR pLys S and HMS 
174 pLys S cells for expression. 

The amino acid sequence of expressed recombinant L523S is shown in 
SEQ ID NO:449, and the DNA coding region sequence is shown in SEQ ID NO:450. 
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EXAMPLE 27 

EPITOPE-ANALYSIS OF L514S AND L523S-SPECIFIC ANTIBODIES 

5 Peptides of candidate antigens can be used for the evaluation of antibody 

responses in both preclinical and clinical studies. These data allow one to further 
confirm the antibody response against a certain candidate antigen. Protein-based ELISA 
with and without competitive peptides and peptide-based ELISA can be used to 
evaluate these antibody responses. Peptide ELISA is especially useful since it can 

10 further exclude the false positive of the antibody titer observed in protein-based ELISA 
as well as to provide the simplest assay system to test antibody responses to candidate 
antigens. In this example, data was obtained using both L514S- and L523S-peptides 
that show that individual cancer patients produce L514S- and L523S-specific 
antibodies. The L514S-specific antibodies-recognize primarily the following epitope of 

15 L514S: 

aa86-110: LGKEVRDAKITPEAFEKLGFPAAKE (SED ID 

NO:451). 

This epitope is the common epitope in humans. A rabbit antibody 
specific for L5 14S recognizes two addition epitopes of L514S: 
20 (1) aa21-45: KASDGDVYTLAVPMGDVPMDGISVA (SEQ ID 

NO:452) 

(2) aal21-135: PDRDVNLTHQLNPKVK (SED ID NO:453) 
It was further found that the SEQ ID NO:452 is common to both L514S 
isoforms, L514S-13160 and L514S-13166, whereas the other epitopes, SEQ ID NO:451 
25 and SEQ ID NO:453, are probably specific to the isoform, L514S-13 160. 

The L523S-specific antibodies recognize primarily the following epitope 

ofL523S: 

aa440-460: KIAPAEAPDAKVRMVIITGP , (SEQ ID NO:454). 
This epitope is the common epitope in humans. A rabbit antibody 
30 specific for L523S recognizes two other epitopes: 



166 



WO 02/00174 



PCT/USOl/21065 



, (1) aal56-175 PDGAAQQNNNPLQQPRG (SEQ ID 

NO:455) 

(2) aa326-345: RTITVKGNVETCAKAEEEIM (SED ED 
NO:456) 

5 

In further studies, it was determined by peptide based ELISAs that eight 
additional epitopes of L523S were recognized by L523S-specific antibodies: 

(l)aa40-59 AFVDCPDESWALKADBALS (SEQ ID 

NO:457) 

10 (2)aa80-99: ERKLQBRMPPHLQWEVLDS (SED ID 

NO:458) 

(3) aal60-179: AQQNPLQQPRGRRGLGQRGS (SEQ ID 
NO:459) 

(4) aal80-199: DVHRKENAGAAEKSITDLST (SED ID 
15 NO:460) 

(5) aa320-339: LYNPERTITVKGNVETCAKA (SEQ ID 
NO:461) 

(6) aa340-359: EEEIMKKIRESYENDIASMN (SED ID 
NO:462) 

20 (7) aa370-389: LNALGLFPPTSGMPPPTSGP (SEQ ED 

NO:463) 

(8) aa380-399: KIAPAEAPDAKVRMVHTGP (SED ID 

NO:464) 

Out of these, six epitopes are common in both lung plural effusion fluid 
25 samples and in sera of lung patients. Of these six, SEQ ID NO:459 and SEQ ED 
NO:463 have no homology to other L523S-family proteins such as IGF-II mRNA- 
binding proteins 1 and 2. Accordingly, this indicates that these two peptides can be 
used as an assay system to determine the antibody response to L523S. 



30 
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, EXAMPLE 28 

Generation of L523S-Specific CTL Lines Using In Vitro Whole-Gene Priming 

To determine if L523S is capable of generating a CD8 + T cell immune 
5 response, CTLs were generated using in vitro whole-gene priming methodologies with 
tumor antigen-vaccinia infected DC (Yee et al, The Journal of Immunology, 
157(9):4079-86, 1996), human CTL lines were derived that specifically recognize 
autologous fibroblasts transduced with the L552S tumor antigen, as determined by 
interferon-gamma ELISPOT analysis. Specifically, dendritic cells (DC) were 

10 differentiated from Percoll-purified monocytes derived from PBMC of normal human 
donors by plastic adherence and growing for five days in RPMI medium containing 
10% human serum, 50 ng/ml human GM-CSF and 30 ng/ml human IL-4. Following the 
five days of culture, the DC were infected overnight with a recombinant adenovirus that 
expresses L523S at a multiplicity of infection (M.O.I) of 33, 66 and 100, and matured 

15 overnight by the addition of 2 ^ig/ml CD40 ligand. The virus was then inactivated by 
UV irradiation. In order to generate a CTL line, autologous PBMC were isolated and 
CD8+ T cells were enriched for by the negative selection using magnetic beads 
conjugated to CD4+, CD14+, CD16+, CD19+, CD34+ and CD56f cells. CD8+ T cells 
specific for L523S were established in round bottom 96-well plates using 10,000 L523S 

20 expressing DCs and 100,000 CD8+ T cells per well in RPMI supplemented with 10% 
human serum, lOng/ml of BL-6 and 5ng/ml of IL-12. The cultures were restimulated 
every 7-10 days using autologous primary fibroblasts retrovirally transduced with 
L523S, and the costimulatory molecule CD80 in the presence of IL-2. The cells were 
also stimulated with IFN-gamma to upregulate MHC Class I. The media was 

25 supplemented with 1 OU/ml of EL-2 at the time of stimulation as well as on days 2 and 5 
following stimulation. Following three stimulation cycles, ten L523S specific CD8+ T 
cell lines were identified using interferon-gamma ELISPOT analysis that specifically 
produce interferon-gamma when stimulated with the L523S tumor antigen-transduced 
autologous fibroblasts, but not with a control antigen. 
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/ One line, 6B1, was cloned using anti-CD3 and feeder cells. The clones 
were tested for specificity on L523S-transduced fibroblasts. In addition, using a panel 
of HLA-mismalched lines transduced with a vector expressing L523S and measuring 
interferon-gamma production by this CTL line in an ELISPOT assay, it was determined 
5 that this clone 6B1.4B8 is restricted by HLA-A020L ' 

Also using transfected Cos cells, it was shown that clone 6B1.4B8 
recognizes Cos cells transfected with pcDNA3 HLA A0201/L523S in an HLA- 
restricted and antigen specific manner. 

An epitope mapping study demonstrated the clone 6B1.4B8 recognizes 
1 0 HLA-A201 LCL loaded with peptide pool 3 (a polypeptide corresponding to amino acid 
positions 33-59 of L523S. 

A peptide pool breakdown study demonstrated that clone 6B1.4B8 
recognizes autologous B-LCL loaded with 15-mer peptides from amino acid positions 
37-55 of L523S, TGYAFVCPDESWALKAIE (SEQ ID NO:465). A further peptide 
15 breakdown study demonstrated that clone 6B1.4B8 recognizes T2 cells loaded with the 
same 15-mer peptides. 

A peptide recognition study demonstrated that clone 6B1.4B8 prefers T2 
cells loaded with the peptide FVDCPESWAL (SEQ ID NO:466) which is corresponds 
to the amino acid sequence at positions 41-51 of L523S and is encoded by the DNA 
20 sequence of SEQ ID NO:467. 

EXAMPLE 29 
L523S EXPRESSION IN OTHER HUMAN CANCERS 

25 It was previously disclosed in Example 2 that L523S is expressed in lung 

cancers including squamous, adenocarcinoma and small cell carcinoma. EST profiling 
analyisis of L523S further indicates that this protein may also be expressed in a number 
other tumor types, inclduing colon adenocarcinomas, prostate adenocarcinomas, CML, 
AML, Burkitt's Lymphoma, brain tumors, retinoblastomas, ovarian tumors, 
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teratocarcinomas, uterus myosarcomas, germ cell tumors as well as pancreatic and 
cervical tumor cell lines. 

From the foregoing it will be appreciated that, although specific 
embodiments of the invention have been described herein for purposes of illustration, 
various modifications may be made without deviating from the spirit and scope of the 
invention. Accordingly, the invention is not limited except as by the appended claims. 
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CLAIMS 

1. An isolated polynucleotide comprising a sequence selected from 
the group consisting of: 

(a) sequences provided in SEQ ID NOs:351, 353, 358, 362, 364, 
366, 368, 370-375, 420, 424, 428, 431, 434, 442, 447, 450 and 467; 

(b) complements of the sequences provided in SEQ ID NOs:351, 
353, 358, 362, 364, 366, 368, 370-375, 420, 424, 428, 431, 434, 442, 447, 450 and 467; 

(c) sequences consisting of at least 10 contiguous residues of a 
sequence provided in SEQ ID NOs:351, 353, 358, 362, 364, 366, 368, 370-375, 420, 
424, 428, 431, 434, 442, 447, 450 and 467; 

(d) sequences that hybridize to a sequence provided in SEQ ID 
NOs:351, 353, 358, 362, 364, 366, 368, 370-375, 420, 424, 428, 431, 434, 442, 447, 
450 and 467, under highly stringent conditions; 

(e) sequences having at least 75% identity to a sequence of SEQ ID 
NOs:351, 353, 358, 362, 364, 366, 368, 370-375, 420, 424, 428, 431, 434, 442, 447, 
450 and 467; 

(f) sequences having at least 90% identity to a sequence of SEQ ID 
NOs:351, 353, 358, 362, 364, 366, 368, 370-375, 420, 424, 428, 431, 434, 442, 447, 
450 and 467; and 

(g) degenerate variants of a sequence provided in SEQ ID NOs:351, 
353, 358, 362, 364, 366, 368, 370-375, 420, 424, 428, 431, 434, 442, 447, 450 and 467. 



2. An isolated polypeptide comprising an amino acid sequence 
selected from the group consisting of: 

(a) sequences having at least 90% identity to a polypeptide having an 
amino acid sequence of any one of the sequences provided in SEQ ID NOs:352, 354, 
357, 361, 363, 365, 367, 369, 376-382, 387-419, 423, 427, 430, 433, 441, 443, 446, 449 
and 451-466; 
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/ (b) sequences encoded by a polynucleotide of claim 1 ; 

(c) sequences having at least 70% identity to a sequence encoded by 
a polynucleotide of claim 1 ; and 

(d) sequences having at least 90% identity to a sequence encoded by 
a polynucleotide of claim 1. 

3. An expression vector comprising a polynucleotide of claim 1 
operably linked to' an expression control sequence. 

4. A host cell transformed or transfected with an expression vector 
according to claim 3. 

5. An isolated antibody, or antigen-binding fragment thereof, that 
specifically binds to a polypeptide of claim 2. 

6. A method for detecting the presence of a cancer in a patient, 
comprising the steps of: 

(a) obtaining a biological sample from the patient; 

(b) contacting the biological sample with a binding agent that binds 
to a polypeptide of claim 2; ' 

(c) detecting in the sample an amount of polypeptide that binds to 
the binding agent; and 

(d) comparing the amount of polypeptide to a predetermined cut-off 
value and therefrom determining the presence of a cancer in the patient. 

7. A fusion protein comprising at least one polypeptide according to 

claim 2. 
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/ 8. A fusion protein according to claim 9, wherein the fusion protein 
is selected from the group consisting sequences provided in SEQ ID NOs:352, 354, 423, 
427,430 and 433. 

9. An oligonucleotide that hybridizes to a sequence recited in SEQ 
ID NOs:351, 353, 358, 362, 364, 366, 368, 370-375, 420, 424, 428, 431, 434, 442, 447, 
450 and 467 under highly stringent conditions. 

10. A method for stimulating and/or expanding T cells specific for a 
tumor protein, comprising contacting T cells with at least one component selected from 
the group consisting of: 

(a) polypeptides according to claim 2; 

(b) polynucleotides according to claim 1; and 

(c) antigenrpresenting cells that express a polynucleotide according 

to claim 1, 

under conditions and for a time sufficient to pennit the stimulation 
and/or expansion of T cells. 

11. An isolated T cell population, comprising T cells prepared 
according to the method of claim 10. 

12- A composition comprising a first component selected from the 
group consisting of physiologically acceptable carriers and immunostimulants, and a 
second component selected from the group consisting of: 

(a) polypeptides according to claim 2; 

(b) polynucleotides according to claim 1 ; 

(c) antibodies according to claim 5; 

(d) fusion proteins according to claim 7; , 

(e) T cell populations according to claim 1 1; and 
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/ (f) antigen presenting cells that express a polypeptide according to 

claim 2. 

13. A method for stimulating an immune response in a patient, 
comprising administering to the patient a composition of claim 12. 

14. A method for the treatment of a lung cancer in a patient, 
comprising administering to the patient a composition of claim 12. 

15. A method for determining the presence of a cancer in a patient, 
comprising the steps of: 

(a) obtaining a biological sample from the patient; 

(b) contacting the biological sample with an oligonucleotide 
according to claim 9; 

(c) detecting in the sample an amount of a polynucleotide that 
hybridizes to the oligonucleotide; and 

(d) compare the amount of polynucleotide that hybridizes to the 
oligonucleotide to a predetermined cut-off value, and therefrom determining the 
presence of the cancer in the patient. 

16. A diagnostic kit comprising at least one oligonucleotide 
according to claim 9. 

17. A diagnostic kit comprising at least one antibody according to 
claim 5 and a detection reagent, wherein the detection reagent comprises a reporter 
group. 
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/ 18. A method for the treatment of lung cancer in a patient, 
comprising the steps of: 

(a) incubating CD4+ and/or CD8+ T cells isolated from a patient 
with at least one component selected from the group consisting of: (i) polypeptides 
according to claim 2; (ii) polynucleotides according to claim 1; and (iii) antigen 
presenting cells that express a polypeptide of claim 2, such that T cell proliferate; 

(b) administering to the patient an effective amount of the 
proliferated T cells, 

and thereby inhibiting the development of a cancer in the patient. 
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SEQUENCE LISTING 

<110> Corixa Corporation 
Wang, Tongtong 
Marnera kis , Margarita 
Fanger, Gary R. 
Vedvick, Thomas S. 
Carter, Darrick 
Watanabe, Yoshihiro 
Henderson, Robert A. 
Peckham, David W. 
Fanger, Neil 



<120> COMPOSITIONS AND METHODS FOR THE THERAPY 
AND DIAGNOSIS OF LONG CANCER 



<130> 210121. 45502PC 

<140> PCT 

<141> 2001-06-28 



<160> 467 



<170> FastSEQ for Windows Version 4.0 

<210> 1 

<211> 315 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 
<222> 236, 241 
<223> n = A, T,C or G 



<400> 1 

gcagagacag actggtggtt gaacctggag gtgccaaaaa agccagctgc gggcccagga 60 
cagctgccgt gagactcccg atgtcacagg cagtctgtgt ggttacagcg cccctcagtg 120 
ttcatctcca gcagagacaa cggaggaggc tcccaccagg acggttctca ttatttatat 180 
gttaatatgt ttgtaaactc atgtacagtt ttttttgggg gggaagcaat gggaanggta 240 
naaattacaa atagaatcat ttgctgtaat ccttaaatgg caaacggtca ggccacgtga 300 
aaaaaaaaaa aaaaa 3 15 

<210> 2 
<211> 380 
<212> DNA 

<213> Homo sapiens 

i 

<400> 2 

atttaggctt aagattttgt ttacccttgt tactaaggag caaattagta ttaaagtata 60 
atatatataa acaaatacaa aaagttttga gtggttcagc ttttttattt tttttaatgg 120 
cataactttt aacaacactg ctctgtaatg ggttgaactg tggtactcag actgagataa 180 
ctgaaatgag tggatgtata gtgttattgc ataattatcc cactatgaag caaagggact 240 
ggataaattc ccagtctaga ttattagcct ttgttaacca tcaagcacct agaagaagaa 300 
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ttattggaaa ttttgtcctc tgtaactggc actttggggt gtgacttatc ttttgccttt 360 
gtaaaaaaaa aaaaaaaaaa 380 

'\ 

<210> 3 

<211> 346 

<212> DNA 

<213> Homo sapiens 

<220> 

t <221> misc_feature 

<222> 316, 317, 318, 322, 323, 326, 329, 330, 331, 336, 337, 339, 

340, 342, 343 

<223> n = A,T,C or G 

<400> 3 

ttgtaagtat acaattttag aaaggattaa atgttattga tcattttact gaatactgca 60 

catcctcacc atacaccatc cactttccaa taacatttaa tcctttctaa aattgtaagt 120 

atacaattgt actttctttg gattttcata acaaatatac catagactgt taattttatt 180 

gaagtttcct taatggaatg agtcattttt gtcttgtgct tttgaggtta cctttgcttt 240 

gacttccaac aatttgatca tatagtgttg agctgtggaa atctttaagt ttattctata 300 

gcaataattt ctattnnnag annccnggnn naaaannann annaaa 346 

<210> 4 

<211> 372 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> mis cofeature 
<222> 297, 306, 332 
<223> n = A, T, C or G 

<400> 4 

actagtctca ttactccaga attatgctct 
tggtttggtt tggttttttg aactggtatg 
tctcttctcc aagttgtgct ttgtggggac 
agttcaagct gttgaaaaga ctattgctta 
tgtggacagt gcacgtgcct tacgctacat 
aaggantggg tgctttgtga tggataaaac 
aaaacaaaac aa 

<210> 5 

<211> 698 

<212> DNA 

<213> Homo sapiens 



tgtacctgtg tggctgggtt tcttagtcgt 60 
tagggtggtt cacagttcta atgtaagcac 120 
aatcattctl: tgaacattag agaggaaggc 180 
tttttgtttt taaagaccta cttgacgtca 240 
cttgttttct aggaagaagg ggatgcnggg 300 
gnctaaataa cacaccttta cattttgaaa 360 

372 



<220> 

<221> misc_feature 

<222> 8, 345, 422, 430, 433, 436, 438, 472, 481, 486, 515, 521, 
536, 549, 553, 556, 557, 559, 568, 593, 597, 605, 611, 613, 
616, 618, 620, 628, 630, 632, 634, 635, 639, 643, 647, 648, 
649, 652, 654, 658, 664, 690 
<223> n =» A, T,C or G 



<400> 5 

actagtanga tagaaacact gtgtcccgag 
cctaacccag gttaactgca agaagaggcg 
gcataaagcc aatgtagtcc agtttctaag 



agtaaggaga gaagctacta ttgattagag 60 
ggatactttc agctttccat gtaactgtat 120 
atcatgttcc aagctaactg aatcccactt 180 
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caatacacac tcatgaactc ctgatggaac "aataacaggc ccaagcctgt ggtatgatgt 240 
gcacacttgc tagactcaga aaaaatacta ctctcataaa tgggtgggag tattttgggt 300 
^gacaacctac tttgcttggc tgagtgaagg aatgatattc atatnttcat ttattccatg 360 
gacatttagt tagtgctttt tatataccag gcatgatgct gagtgacact cttgtgtata 420 
tntccaaatn ttngtncngt cgctgcacat atctgaaatc ctatattaag antttcccaa 480 
natgangtcc ctggtttttc cacgccactt gatcngtcaa ngatctcacc tctgtntgtc 540 
ctaaaaccnt ctnctnnang gttagacngg acctctcttc tcccttcccg aanaatnaag 600 
tgtgngaaga nanccncncn cccccctncn tncnncctng ccngctnnnc cncntgtngg 660 
gggngccgcc cccgcggggg gacccccccn ttttcccc 698 

<210> 6 

<211> 740 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc feature 

<222> 82, 406, 426, 434, 462, 536, 551, 558, 563, 567, 582, 584, 
592, 638, 651, 660, 664, 673, 675, 697, 706, 711, 715, 716, 
717, 723, 724, 725, 733 
<223> n - A,T,C or G 

<400> 6 

actagtcaaa aatgctaaaa taatttggga gaaaatattt tttaagtagt gttatagttt 60 
catgtttatc ttttattatg tnttgtgaag ttgtgtcttt tcactaatta cctatactat 120 
gccaatattt ccttatatct atccataaca tttatactac atttgtaaga gaatatgcac 180 
gtgaaactta acactttata aggtaaaaat gaggtttcca agatttaata atctgatcaa 240 
gttcttgtta tttccaaata gaatggactt ggtctgttaa ggggctaagg gagaagaaga 300 
agataaggtt aaaagttgtt aatgaccaaa cattctaaaa gaaatgcaaa aaaaaattta 360 
ttttcaagcc ttcgaactat ttaaggaaag caaaatcatt tcctanatgc atatcatttg 420 
tgagantttc tcantaatat cctgaatcat tcatttcagc tnaggcttca tgttgactcg 480 
atatgtcatc tagggaaagt ctatttcatg gtccaaacct gttgccatag ttggtnaggc 540 
tttcctttaa ntgtgaanta ttnacangaa attttctctt tnanagttct tnatagggtt 600 
aggggtgtgg gaaaagcttc taacaatctg tagtgttncg tgttatctgt ncagaaccan 660 
aatnacggat cgnangaagg actgggtcta tttacangaa cgaatnatct ngttnnntgt 720 
gtnnncaact ccngggagcc 74 q 

<210> 7 

<211> 670 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc feature 

£222> 265, 268, 457, 470, 485, 546, 553, 566, 590, 596, 613, 624, 
639, 653, 659, 661 
<223> n = A,T,C or G 

<400> 7 

gctggggagc tcggcatggc ggtccccgct gcagccatgg ggccctcggc gttgggccag 60 
agcggccccg gctcgatggc cccgtggtgc tcagtgagca gcggcccgtc gcgctacgtg 120 
cttgggatgc aggagctgtt ccggggccac agcaagaccg cgagttcctg gcgcacagcg 180 
ccaaggtgca ctcggtggcc tggagttgcg acgggcgtcg cctacctcgg ggtcttcgac 240 
aagacgccac gtcttcttgc tgganaanga ccgttggtca aagaaaacaa ttatcgggga 300 
catggggata gtgtggacca ctttgttggc atccaagtaa tcctgaccta tttgttacgg 360 
cgtctggaga taaaaccatt cgcatctggg atgtgaggac tacaaaatgc attgccactg 420 
tgaacactaa aggggagaac attaatatct gctggantcc tgatgggcan accattgctg 480 
tagcnacaag gatgatgtgg tgactttatt gatgccaaga aaccccgttc caaagcaaaa 540 
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aaacanttcc aanttcgaag tcaccnaaat ctcctggaac aatgaacatn aatatnttct 600 
tcctgacaat ggnccttggg tgtntcacat cctcagctnc cccaaaactg aancctgtnc 660 
natccacccc ' 670 

<210> 8 

<211> 689 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 253, 335, 410, 428, 448, 458, 466, 479, 480, 482, 483, 485, 
488, 491, 492, 495, 499, 500, 502, 503, 512, 516, 524, 525, 
526, 527, 530, 540, 546, 550, 581, 593, 594, 601, 606, 609, 
610, 620*, 621, 622, 628, 641, 646, 656, 673 
<223> n = A, T, C or G 

<400> 8 

actagtatct aggaatgaac agtaaaagag gagcagttgg ctacttgatt acaacagagt 60 
aaatgaagta ctggatttgg gaaaacctgg ttttattaga acatatggaa tgaaagccta 120 
cacctagcat tgcctactta gccccctgaa ttaacagagc ccaattgaga caaacccctg 180 
gcaacaggaa attcaaggga gaaaaagtaa gcaacttggg ctaggatgag ctgactccct 240 
tagagcaaag ganagacagc ccccattacc aaataccatt tttgcctggg gcttgtgcag 300 
ctggcagtgt tcctgcccca gcatggcacc ttatngtttt gatagcaact tcgttgaatt 360 
ttcaccaact tattacttga aattataata tagcctgtcc gtttgctgtn tccaggctgt 420 
gatatatntt cctagtggtt tgactttnaa aataaatnag gtttantttt ctccccccnn 480 
cnntnctncc nntcnctcnn cnntcccccc cnctcngtcc tccnnnnttn gggggggccn 540 
cccccncggn ggacccccct ttggtccctt agtggaggtt natggcccct ggnnttatcc 600 
nggccntann tttccccgtn nnaaatgntt ccccctccca ntcccnccac ctcaanccgg 660 
aagcctaagt ttntaccctg ggggtcccc * 689 

<210> 9 
<211> 674 
<212> DNA 

<213> Homo sapiens , 
<220> 

<221> misc_feature 
<222> 602, 632, 639, 668 
<223> n = A,T,C or G 

<400> 9 

gtccactctc ctttgagtgt actgtcttac tgtgcactct gtttttcaac tttctagata 60 
taaaaaatgc ttgttctata gtggagtaag agctcacaca cccaaggcag caagataact 120 
gaaaaaagcg aggctttttt gccaccttgg taaaggccag ttcactgcta tagaactgct 180 
ataagcctga agggaagtag ctatgagact ttccattttt cttagttctc ccaataggct 240 
ccttcatgga aaaaggcttc ctgtaataat tttcacctaa tgaattagca gtgtgattat 300 
ttctgaaata agagacaaat tgggccgcag agtcttcctg tgatttaaaa taaacaaccc 360 
aaagttttgt ttggtcttca ccaaaggaca tactctaggg ggtatgttgt tgaagacatt 420 
caaaaacatt agctgttctg tctttcaatt tcaagttatt ttggagactg cctccatgtg 480 
agttaattac tttgctctgg aactagcatt attgtcatta tcatcacatt ctgtcatcat 540 
catctgaata atattgtgga tttccccctc tgcttgcatc ttcttttgac tcctctggga 600 
anaaatgtca aaaaaaaagg tcgatctact cngcaaggnc catctaatca ctgcgctgga 660 
aggacccnct gccc 674 

<210> 10 
<211> 346 
<212> DNA 
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<213> Homo sapiens 
; <220> 

<221> misc_feature 

<222> 320, 321, 322, 325, 326, 328, 329, 330, 332, 333, 334, 335, 
<223> n = A,T,C or G 
<400> 10 

actagtctgc tgatagaaag cactatacat cctattgttt ctttctttcc aaaatcagcc 60 
ttctgtctgt aacaaaaatg tactttatag agatggagga aaaggtctaa tactacatag 120 
ccttaagtgt ttctgtcatt gttcaagtgt attttctgta acagaaacat atttggaatg 180 
tttttctttt ccccttataa attgtaattc ctgaaatact gctgctttaa aaagtcccac 240 
tgtcagatta tattatctaa caattgaata ttgtaaatat acttgtctta cctctcaata 300 
aaagggtact tttctattan nnagnngnnn gnnnnataaa anaaaa 346 

<210> 11 

<211> 602 

<212> DNA 

<213> Homo sapiens 

<400> 11 

actagtaaaa agcagcattg ccaaataatc cctaattttc cactaaaaat ataatgaaat 60 
gatgttaagc tttttgaaaa gtttaggtta aacctactgt tgttagatta atgtatttgt 120 
tgcttccctt tatctggaat gtggcattag cttttttatt ttaaccctct ttaattctta 180 
ttcaattcca tgacttaagg ttggagagct aaacactggg atttttggat aacagactga 240 
cagttttgca taattataat cggcattgta catagaaagg atatggctac cttttgttaa 300 
atctgcactt tctaaatatc aaaaaaggga aatgaagtta taaatcaatt tttgtataat 360 
ctgtttgaaa catgagtttt atttgcttaa tattagggct ttgccccttt tctgtaagtc 420 
tcttgggatc ctgtgtagaa ctgttctcat taaacaccaa acagttaagt ccattctctg 480 
gtactagcta caaattcggt ttcatattct acttaacaat ttaaataaac tgaaatattt 540 
ctagatggtc tacttctgtt catataaaaa caaaacttga tttccaaaaa aaaaaaaaaa 600 
aa 602 
<210> 12 

<211> 685 1 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> miscfeature 

<222> 170, 279, 318, 321, 322, 422, 450, 453, 459, 467, 468, 470, 
473, 475, 482, 485, 486, 491, 498, 503, 506, 509, 522, 526 
527, 528, 538, 542, 544, 551, 567, 568, 569, 574, 576, 582, 
587, 588, 589, 590, 592, 593, 598, 599, 603, 605, 608 
<223> n = A,T,C or G 

<221> misc_feature 

<222> 633, 634, 635, 644, 646, 648, 651, 655, 660, 662, 663, 672, 
o/4, 675, 682, 683 
<223> n = A,T,C or G 

<400> 12 

actagtcctg tgaaagtaca actgaaggca gaaagtgtta ggattttgca tctaatgttc 60 
attatcatgg tattgatgga cctaagaaaa taaaaattag actaagcccc caaataagct 120 
f 03 ^!" St**^*** tagtagattt gaatatatag atgtagtatn ttgggtatct 180 
aggtgtttta tcattatgta aaggaattaa agtaaaggac tttgtagttg tttttattaa 240 
atatgcatat agtagagtgc aaaaatatag caaaaatana aactaaaggt agaaaagcat 300 
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tttagatatg ccttaatnta nnaactgtgc caggtggccc tcggaataga tgccaggcag 360 
agaccagtgc ctgggtggtg cctccccttg tctgcccccc tgaagaactt ccctcacgtg 420 
^angtagtgcc ctcgtaggtg tcacgtggan tantggganc aggccgnncn gtnanaagaa 480 
ancanngtga nagtttcncc gtngangcng aactgtccct gngccnnnac gctcccanaa 540 
cntntccaat ngacaatcga gtttccnrmc tccngnaacc tngccgnnnn cnngcccnnc 600 
cantntgnta accccgcgcc cggatcgctc tcnnntcgtt ctcncncnaa ngggntttcn 660 
cnnccgccgt cncnnccccg cnncc 685 

<210> 13 

<211> 694 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 503, 546, 599, 611, 636, 641, 643, 645, 656, 658, 662, 676, 
679, 687 

<223> n = A,T,CorG 
<400> 13 

cactagtcac tcattagcgt tttcaatagg gctcttaagt ccagtagatt acgggtagtc 60 

agttgacgaa gatctggttt acaagaacta attaaatgtt tcattgcatt tttgtaagaa 120 

cagaataatt ttataaaatg tttgtagttt ataattgccg aaaataattt aaagacactt 180 

tttctctgtg tgtgcaaatg tgtgtttgtg atccattttt tttttttttt taggacacct 240 

gtttactagc tagctttaca atatgccaaa aaaggatttc tccctgaccc catccgtggt 300 

tcaccctctt ttccccccat gctttttgcc ctagtttata acaaaggaat gatgatgatt 360 

taaaaagtag ttctgtatct tcagtatctt ggtcttccag aaccctctgg ttgggaaggg 420 

gatcattttt tactggtcat ttccctttgg agtgtactac tttaacagat ggaaagaact 480 

cattggccat ggaaacagcc gangtgttgg gagccagcag tgcatggcac cgtccggcat 540 

- ctggcntgat tggtctggct gccgtcattg tcagcacagt gccatgggac atggggaana 600 

ctgactgcac ngccaatggt tttcatgaag aatacngcat ncncngtgat cacgtnancc 660 

angacgctat gggggncana gggccanttg cttc 694 

<210> 14 

<211> 679 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 29, 68, 83, 87, 94, 104, 117, 142, 145, 151, 187, 201, 211, 
226, 229, 239, 241, 245, 252, 255, 259, 303, 309, 359, 387, 
400, 441, 446, 461, 492, 504, 505, 512, 525, 527, 533, 574, 
592, 609, 610, 618, 620, 626, 627, 633, 639, 645, 654 
<223> n = A, T, C or G 

<400> 14 

cagccgcctg catctgtatc cagcgccang tcccgccagt cccagctgcg cgcgcccccc 60 
agtcccgnac ccgttcggcc cangctnagt tagncctcac catnccggtc aaaggangca 120 
ccaagtgcat caaatacctg cngtncggat ntaaattcat cttctggctt gccgggattg 180 
ctgtccntgc cattggacta nggctccgat ncgactctca gaccanganc a,tcttcganc 240 
naganactaa tnatnattnt tccagcttct acacaggagt ctatattctg atcggatccg 300 
gcnccctcnt gatgctggtg ggcttcctga gctgctgcgg ggctgtgcaa gagtcccant 360 
gcatgctggg actgttcttc ggcttcntct tggtgatatn cgcoattgaa atacctgcgg 420 
ccatctgggg atattccact ncgatnatgt gattaaggaa ntccacggag ttttacaagg 480 
acacgtacaa cnacctgaaa accnnggatg anccccaccg ggaancnctg aangccatcc 540 
actatgcgtt gaactgcaat ggtttggctg gggnccttga acaatttaat cncatacatc 600 
tggccccann aaaggacntn ctcganncct tcnccgtgna attcngttct gatnccatca 660 
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cagaagtctc gaacaatcc 

4 <210> 15 
<211> 695 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<222> 105, 172, 176, 179, 189, 203, 212, 219, 221, 229, 231, 238, 
242, 2.61, 266, 270, 278, 285, 286, 298, 311, 324, 337, 350, 
363, 384, 391, 395, 405, 411, 424, 427, 443, 448, 453, 455, 
458, 463, 467, 470, 479, 482, 484, 493, 499, 505, 518 
<223> n = A, T, C or G 

<221> miscjEeature 

<222> 520, 523, .531, 540, 584, 595, 597, 609, 611, 626, 628, 651, 
652, 657, 661, 665, 669, 672, 681, 683, 691, 693 
<223> n = A, T, C or G 



<400> 15 

actagtggat aaaggccagg gatgctgctc aacctcctac catgtacagg gacgtctccc 60 
cattacaact acccaatccg aagtgtcaac tgtgtcagga ctaanaaacc ctggttttga 120 
ttaaaaaagg gcctgaaaaa aggggagcca caaatctgtc tgcttcctca cnttantcnt 180 
tggcaaatna gcattctgtc tcnttggctg cngcctcanc ncaaaaaanc ngaactcnat 240 
cnggcccagg aatacatctc ncaatnaacn aaattganca aggcnntggg aaatgccnga 300 
tgggattatc ntccgcttgt tgancttcta agtttcnttc ccttcattcn accctgccag 360 
ccnagttctg ttagaaaaat gccngaattc naacnccggt tttcntactc ngaatttaga 420 
tctncanaaa cttcctggcc acnattcnaa ttnanggnca cgnacanatn ccttccatna 480 
ancncacccc acntttgana gccangacaa tgactgcntn aantgaaggc ntgaaggaan 540 
aactttgaaa ggaaaaaaaa ctttgtttcc ggccccttcc aacncttctg tgttnancac 600 
tgccttctng naaccctgga agcccngnga cagtgttaca tgttgttcta nnaaacngac 660 
ncttnaatnt cnatcttccc nanaacgatt ncncc 695 

<210> 16 

<211> 669 . ' 

<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc feature 

<222> 299, 354, 483, 555, 571, 573, 577, 642, 651, 662, 667 
<223> n - A,T,C or G 



<400> 16 

cgccgaagca gcagcgcagg ttgtccccgt 
ttcccgggcc ccttacactc cacagtcccg 
agaaccctgc ggaggagacc ggcgaggaga 
tgcctgagag agctgaagag gcaaagctaa 
ctggaggctc cgacttcctc atgaagagac 
gagactacaa catggccaaa gccaacatga 
acaagaacct ggtgactggt gatcacatcc 
ctcgctcgtc accagcaagc ttgcgggtgg 
canatctgag acgcttccct ccctgcccca 
tgcttttgca gccangggtc aggaagtggc 
cctgttggtg tcccacccat ggagcccctg 
tntcttncc 



ttcccctccc ccttcccttc tccggttgcc 60 
gtcccgccat gtcccagaaa caagaagaag 120 
agcaggacac gcaggagaaa gaaggtattc 180 
aggccaaata cccaagccta ggacaaaagc 240 
tccagaaagg gcaaaagtac tttgactcng 300 
agaataagca gctgccaagt gpangaccag 360 
ccaccccaca ggatctgccc agagaaagtc 420 
ccaagttgaa tgatgctgcc ggggctctgc 480 
cccgggtcct gtgctggctc ctgcccttcc 540 
ncnggtngtg gctggaaagc aaaacccttt 600 
gggcgagccc angaacttga ncctttttgt 660 

669 
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<210> 17 

<211> 697 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 33, 48, 50, 55, 59, 60, 76, 77, 78, 90, 113, 118, 130, 135, 
141, 143, 150, 156, 166, 167, 170, 172, 180, 181, 190, 192, 
194, 199, 201, 209, 212, 224, 225, 226, 230, 233, 234, 236, 
242, 244, 251, 253, 256, 268, 297, 305, 308, 311, 314 
<223> n - A,T,C or G 

<221> misc_feature 

<222> 315, 317, 322, 324, 327, 333, 337, 343, 362, 364, 367, 368, 
373, 384, 388, 394, 406, 411, 413, 423, 429, 438, 449, 450, 
473, 476, 479, 489, 491, 494, 499, 505, 507, 508, 522, 523, 
527, 530, 533, 535, 538, 539, 545, 548, 550, 552, 555 
<223> n = A, T, C or G 

<221> misc_feature 

<222> 562, 563, 566, 568, 572, 577, 578, 580, 581, .591, 594, 622, 
628, 632, 638, 642, 644, 653, 658, 662, 663, 665, 669, 675, 
680, 686, 689 
<223> n = A,T,C or G 

<400> 17 

gcaagatatg gacaactaag tgagaaggta atnctctact gctctagntn ctccnggcnn 60 
gacgcgctga ggagannnac gctggcccan ctgccggcca cacacgggga tcntggtnat 120 
gcctgcccan gggancccca ncnctcggan cccatntcac acccgnnccn tncgcccacn 180 
ncctggctcn cncngcccng nccagctcnc gnccccctcc gccnnnctcn ttnncntctc 240 
cncnccctcc ncnacnacct cctacccncg gctccctccc cagccccccc ccgcaancct 300 
ccacnacncc ntcnncncga ancnccnctc gcnctcngcc ccngccccct gccccccgcc 360 
cncnacnncg cgntcccccg cgcncgcngc ctcnccccct cccacnacag ncncacccgc 420 
agncacgcnc tccgcccnct gacgccccnn cccgccgcgc? .tcaccttcat ggnccnacng 480 
ccccgctcnc nccnctgcnc gccgncnngg cgccccgccc cnnccgngtn ccncncgnng 540 
ccccngcngn angcngtgcg cnncangncc gngccgnncn ncaccctccg nccnccgccc 600 
cgcccgctgg gggctcccgc cncgcggntc antccccncc cntncgccca ctntccgntc 660 
cnncnctcnc gctcngcgcn cgcccnccnc ccccccc 697 

<210> 18 
<211> 670 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<222> 234, 292, 329, 437, 458, 478, 487, 524, 542, 549, 55P, 557, 
576, 597, 603, 604, 646, 665 
<223> n = A,T,C or G 

i 

<400> 18 

ctcgtgtgaa gggtgcagta cctaagccgg agcggggtag aggcgggccg gcaccccctt 60 
ctgacctcca gtgccgccgg cctcaagatc agacatggcc cagaacttga acgacttggc 120 
gggacggctg cccgccgggc cccggggcat gggcacggcc ctgaagctgt tgctgggggc 180 
cggcgccgtg gcctacggtg tgcgcgaatc tgtgttcacc gtggaaggcg ggcncagagc 240 
catcttcttc aatcggatcg gtggagtgca caggacacta tcctgggccg anggccttca 300 
cttcaggatc cttggttcca gtaccccanc atctatgaca ttcgggccag acctcgaaaa 360 
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aatctcctcc ctacaggctc caaagaccta 
tctcgaccaa tgctcangaa cttcctaaca 
<gaacgantgt tgccgtccat tgtcacgaag 
gncctcacnn ctgatcnccc agcggggcca 
acnnaaaagg gccaaggact tcccctcatc 
tttanccacc 

<210> 19 

<211> 606 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mis cofeature 
<222> 506 

<223> n - A,T,C or G 



cagatggtga atatctccct gcgagtgttg 420 
tgttccancg cctaagggct ggactacnaa 4 80 
tgctcaagaa tttnggtggc caagttcaat 540 
agttanccct ggttgatccc cgggganctg 600 
ctggataatg tggccntcac aaagctcaac 660 

670 



<400> 19 

actagtgcca acctcagctc ccaggccagt tctctgaatg tcgaggagtt ccaggatctc 60 
tggcctcagt tgtccttggt tattgatggg ggacaaattg gggatggcca gagccccgag 120 
tgtcgccttg gctcaactgt ggttgatttg tctgtgcccg gaaagtttgg catcattcgt 180 
ccaggctgtg ccctggaaag tactacagcc atcctccaac agaagtacgg actgctcccc 240 
tcacatgcgt cctacctgtg aaactctggg aagcaggaag . gcccaagacc tggtgctgga 300 
tactatgtgt ctgtccactg acgactgtca aggcctcatt tgcagaggcc accggagcta 360 
gggcactagc ctgactttta aggcagtgtg tctttctgag cactgtagac caagcccttg 420 
gagctgctgg tttagccttg cacctgggga aaggatgtat ttatttgtat tttcatatat 480 
cagccaaaag ctgaatggaa aagttnagaa cattcctagg tggccttatt ctaataagtt 540 
tcttctgtct gttttgtttt tcaattgaaa agttattaaa taacagattt agaatctagt 600 
gagacc 606 

<210> 20 

<211> 449 

<212> DNA 

<213> Homo sapiens 

<400> 20 « 

actagtaaac aacagcagca gaaacatcag tatcagcagc gtcgccagca ggagaatatg 60 
cagcgccaga gccgaggaga acccccgctc cctgaggagg acctgtccaa actcttcaaa 120 
ccaccacagc cgcctgccag gatggactcg ctgctcattg caggccagat aaacacttac 180 
tgccagaaca tcaaggagtt cactgcccaa aacttaggca agctcttcat ggcccaggct 240 
cttcaagaat acaacaacta agaaaaggaa gtttccagaa aagaagttaa catgaactct 300 
tgaagtcaca ccagggcaac tcttggaaga aatatatttg catattgaaa agcacagagg 360 
atttctttag tgtcattgcc gattttggct ataacagtgt ctttctagcc ataataaaat 420 
aaaacaaaat cttgactgct tgctcaaaa 449 

<210> 21 

<211> 409 

<212> DNA 

<213> Homo sapiens 

<400> 21 

tatcaatcaa ctggtgaata attaaacaat gtgtggtgtg atcatacaaa gggtaccact 60 
caatgataaa aggaacaagc tgcctatatg tggaacaaca tggatgcatt tcagaaactt 120 
tatgttgagt gaaagaacaa acacggagaa catactatgt ggttctcttt atgtaacatt 180 
acagaaataa aaacagaggc aaccaccttt gaggcagtat ggagtgagat agactggaaa 240 
aaggaaggaa ggaaactcta cgctgatgga aatgtctgtg tcttcattgg gtggtagtta 300 
tgtggggata tacatttgtc aaaatttatt gaactatata ctaaagaact ctgcatttta 360 
ttgggatgta aataatacct caattaaaaa gacaaaaaaa aaaaaaaaa 409 
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<210> 22 
<211> 649 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<222> 263, 353, 610, 635, 646 

<223> n = A,T,C or G 



<400> 22 

acaattttca ttatcttaag cacattgtac 
tgataaggat ggtacttgca tatggtgaat 
tatttcagtg gaccaacatt gtggcatggc 
caaatctaca agagaccctg gttggttttt 
tcctgaatca gcagggatgg aangagggta 
agctctgaag tgtcacattt aatatcagtt 
aagagagaag aaagaggaag tgttcacttt 
ttatatcagt agttctgagg tattgatagc 
gttgaagcag ggtgaataac taggggcata 
gatgttttct ttggaatttc cggataagtt 
ctgaagttcn tatccatctc attacaacaa 

<210> 23 

<211> 669 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 642, 661 
<223> n = A, T,C or G 



atttctacag aacctgtgat tattctcgca 60 
tact act gtt gacagtttcc gcagaaatcc 120 
agcaaatgcc aacattttgt ggaatagcag 180 
cgttttgttt tctttgtttt ttcccccttc 240 
gggaagttat gaattactcc ittccagtagt 300 
ttttttaaac atgattctag ttnaatgtag 360 
tttaatacac tgatttagaa atttgatgtc 420 
ttgctttatt tctgccttta cgttgacagt 480 
tatatttttt ttttttgtaa gctgtttcat 540 
caggaaaaca tctgcatgtt gttatctagt 600 
aaacncccag aacggnttg 649 



<400> 23 

actagtgccg tactggctga aatccctgca 
tactctcagt caccagctct ggaattagat 
tatcctctga cagcctttgg gctgcctcgg 
tcacctgtcg tgcccccctc tgtcaagact 
cgcaaggtgg tgctgatgca gtgcaacatt 
ctgacacttc tgctgaagtt ggaggacaaa 
ccaaatgaga atatccccga gttggcggct 
gctgaccaga gccggttgac ttctctgcta 
ggaacagtac cctcaactca gccgctgtca 
gccctgatct gcgctgtggc tgtcctggac 
agtattacct gtgaagccct tccctccttt 
nttctaacc 

t <210> 24 
<211> 442 
<212> DNA 
<213> Homo sapiens 



ggaccaggaa gagaaccagt tcagactttg 60 
aaattccttg aagatgtcag gaatgggatc 120 
ccccagcagc cacagcagga ggaggtgaca 180 
ccgacacctg aaccagctga ggtggagact 240 
gagtcggtgg aggagggagt caaacaccac 300 
ctgaaccggc acctgagctg tgacctgatg 360 
gagctggtgc agctgggctt cattagtgag 420 
gaagagactt gaacaagttc aattttgcca 480 
ccgtctcctc ttagagctca ctcgggccag 540 
gtgctgcacc ctctgtcctt ccccccagtc 600 
attattcagg anggctgggg gggctccttg 660 

669 



<400> 24 

actagtacca tcttgacaga ggatacatgc 
tcactgccat cattaagcat cagtttcaaa 
gatgactatc attattctag tcctttgaat 
cttacgatgc acttttctcc agcacatcag 



tcccaaaacg tttgttacca cacttaaaaa 60 
attatagcca ttcatgattt actttttcca 120 
ttgtaagggg aaaaaaaaca aaaacaaaaa 180 
atttcaaatt gaaaattaaa gacatgctat 240 
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ggtaatgcac ttgctagtac tacacacttt ggtacaacaa aaaacagagg caagaaacaa 300 

cggaaagaga aaagccttcc tttgttggcc cttaaactga gtcaagatct gaaatgtaga 360 

..gatgatctct gacgatacct gtatgttctt attgtgtaaa taaaattgct ggtatgaaat 420 

gacctaaaaa aaaaaaaaga aa 442 

<210> 25 

<211> 656 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 330, 342, 418, 548, 579, 608 

<223> n=A,T,CorG 

<400> 25 

tgcaagtacc acacactgtt tgaattttgc acaaaaagtg actgtaggat caggtgatag 60 
ccccggaatg tacagtgtct tggtgcacca agatgccttc taaaggctga cataccttgg 120 
accctaatgg ggcagagagt atagccctag cccagtggtg acatgaccac tccctttggg 180 
aggcctgagg tagaggggag tggtatgtgt tttctcagtg gaagcagcac atgagtgggt 240 
gacaggatgt tagataaagg ctctagttag ggtgtcattg tcatttgaga gactgacaca 300 
ctcctagcag ctggtaaagg ggtgctggan gccatggagg anctctagaa acattagcat 360 
gggctgatct gattacttcc tggcatcccg ctcactttta tgggaagtct tattagangg 420 
atgggacagt tttccatatc cttgctgtgg agctctggaa cactctctaa atttccctct 480 
attaaaaatc actgccctaa ctacacttcc tccttgaagg aatagaaatg gaactttctc 540 
tgacatantt cttggcatgg ggagccagcc acaaatgana atctgaacgt gtccaggttt 600 
ctcctganac tcatctacat agaattggtt aaaccctccc ttggaataag gaaaaa 656 

<210> 26 

<211> 434 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 395 < 
<223> n = A, T,C or G 

<400> 26 

actagttcag actgccacgc caaccccaga aaatacccca catgccagaa aagtgaagtc 60 
ctaggtgttt ccatctatgt ttcaatctgt ccatctacca ggcctcgcga taaaaacaaa 120 
acaaaaaaac gctgccaggt tttagaagca gttctggtct caaaaccatc aggatcctgc 180 
caccagggtt cttttgaaat agtaccacat gtaaaaggga atttggcttt cacttcatct 24 0 
aataactgaa ttgtcaggct ttgattgata attgtagaaa taagtagcct tctgttataq 300 
gaataagtta taatcagtat tcatctcttt gttttttgtc actcttttct ctctaattgt 360 
gtcatttgta ctgtttgaaa aatatttctt ctatnaaatt aaactaacct gccttaaaaa 420 
aaaaaaaaaa aaaa 

434 

<210> 27 
<211> 654 
<212> DNA 

<213> Homo sapiens ? 
<220> 

<221> mis cofeature 

<222> 505, 533, 563, 592, 613, 635, 638 
<223> n = A,T,C or G 
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<400> 27 

actagtccaa cacagtcaga aacattgttt 
taataaacca ggatccattt aggtaccact 
tttatactgc atcctttaca ttagccacta 
cagaatccta tggattgcag catttcactt 
gcagtttctc aaaagcagaa acatgccgcc 
gaatgtaagg gcagctggcc cccaatgtgg 
ttcttgttcg cggctaaatg acagtttctg 
gtgttgattt acaaagaggc cagctaatag 
attcaagctg tgagccaggc agganctcag 
ggtacaaaaa aaattttaaa gcntttatgt 
aattgttaag aanaatttta agtgtccaga 

<210> 28 

<211> 670 

<212> DNA 

<213> Homo sapiens 



tgaatcctct gtaaaccaag gcattaatct 60 
tgatataaaa aggatatcca taatgaatat 120 
aatacgttat tgcttgatga agacctttca 180 
ggctacttca tacccatgcc ttaaagaggg 240 
agttctcaag ttttcctcct aactccattt 300 
ggaggtccga acattttctg aattcccatt 360 
tcattactta gattccgatc tttcccaaag 420 
cagaaatcat gaccctgaaa gagagatgaa 480 
tatggcaaag gtcttgagaa tcngccattt 540 
tataccatgg aaccatagaa anggcaaggg 600 
cccanaanga aaaaaaaaaa aaaa 654 



<220> 

<221> mis cofeature 

<222> 101, 226, 274, 330, 385, 392, 397, 402, 452, 473, 476, 532, 
534, 538, 550, 583, 595, 604, 613, 622, 643, 669 
<223> n = A,T,C or G 



<400> 28 

cgtgtgcaca tactgggagg atttccacag ctgcacggtc acagccctta cggattgcca 60 

ggaaggggcg aaagatatgt gggataaact gagaaaagaa nccaaaaacc tcaacatcca 120 

aggcagctta ttcgaactct gcggcagcgg caacggggcg gcggggtccc tgctcccggc 180 

gttcccggtg ctcctggtgt ctctctcggc agctttagcg acctgncttt ccttctgagc 240 

gtggggccag ctccccccgc ggcgcccacc cacnctcact ccatgctccc ggaaatcgag 300 

aggaagatca ttagttcttt ggggacgttn gtgattctct gtgatgctga aaaacactca 360 

tatagggaat gtgggaaatc ctganctctt tnttatntcg tntgatttct tgtgttttat 420 

ttgccaaaat gttaccaatc agtgaccaac cnagcacagc caaaaatcgg acntcngctt 480 

tagtccgtct tcacacacag aataagaaaa cggcaaaccc accccacttt tnantttnat 540 

tattactaan ttttttctgt tgggcaaaag aatctcagga acngccctgg ggccnccgta 600 

ctanagttaa ccnagctagt tncatgaaaa atgatgggct ccncctcaat gggaaagcca 660 
agaaaaagnc 670 

<210> 29 

<211> 551 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 336, 474, 504, 511, 522, 523, 524, 540, 547 
<223> n - A,T,C or G 



<400> 29 

actagtcctc cacagcctgt gaatccccct 
agatctcagc gtttagccac cttacccatg 
ccctctccag ccactgatgg gaaagtattc 
tcagtaccag aggtgcctga tgttgcacat 
cctcttgact taagtcgtgg ttcagaagtt 
cgtaatgaat gtcccagggc agaaaaagag 
aaagcaatag ctgatgggaa gaggagctcc 
aaaagtgaaa ttgggaagac aaaagctcaa 
aggaaggaag agagaagaga gacnaagatc 



agacctttca agcatagtga gcggagaaga 60 
cctgatgatt ctgtagaaaa ggtttcttct 120 
tccatcagtt ctcaaaatca ^caagaatct 180 
ttgccacttg agaagctggg accctgtctc 240 
acagcaccgg tagcctcaga ttcctcttac 300 
gatacncaga tgcttccaaa tccttcttcc 360 
agcagcagca ggaatatcga aaacagaaaa 420 
cagcatttgg taaggagaaa aganaagatg 480 
nctacggacc gnnncggaag aagaagaagn 540 
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aaaaaanaaa a 552 

^<210> 30 
<211> 684 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 545, 570, 606, 657, 684 

<223> n = A,T,C or G 

<400> 30 

actagttcta tctggaaaaa gcccgggttg gaagaagctg tggagagtgc gtgtgcaatg 60 
cgagactcat ttcttggaag catccctggc aaaaatgcag ctgagtacaa ggttatcact 120 
gtgatagaac ctggactgct ttttgagata atagagatgc tgcagtctga agagacttcc 180 
agcacctctc agttgaatga attaatgatg gcttctgagt caactttact ggctcaggaa 240 
ccacgagaga tgactgcaga tgtaatcgag cttaaaggga aattcctcat caacttagaa 300 
ggtggtgata ttcgtgaaga gtcttcctat aaagtaattg tcatgccgac tacgaaagaa 360 
aaatgccccc gttgttggaa gtatacagcg ggagtcttca gatacactgt gtcctcgatg 420 
tgcagaagtt gtcagtggga aaatagtatt aacagctcac tcgagcaaga accctcctga 480 
cagtactggg ctagaagttt ggatggatta tttacaatat aggaaagaaa gccaagaatt 540 
aggtnatgag tggatgagta aatggtggan gatggggaat tcaaatcaga attatggaag 600 
aagttnttcc tgttactata gaaaggaatt atgtttattt acatgcagaa aatatanatg 660 
tgtggtgtgt accgtggatg gaan 684 

<210> 31 

<211> 654 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 326, 582, 651 
<223> n = A,T,C or G 

<400> 31 ' 
gcgcagaaaa ggaaccaata tttcagaaac aagcttaata ggaacagctg cctgtacatc 60 
aacatcttct cagaatgacc cagaagttat catcgtggga gctggcgtgc ttggctctgc 120 
tttggcagct gtgctttcca gagatggaag aaaggtgaca gtcattgaga gagacttaaa 180 
agagcctgac agaatagttg gagaattcct gcagccgggt ggttatcatg ttctcaaaga 240 
ccttggtctt ggagatacag tggaaggtct tgatgcccag gttgtaaatg gttacatgat 300 
tcatgatcag ggaaagcaaa tcagangttc agattcctta ccctctgtca gaaaacaatc 360 
aagtgcagag tggaagagct ttccatcacg gaagattcat catgagtctc cggaaagcag 420 
ctatggcaga gcccaatgca aagtttattg aaggtgttgt gttacagtta ttagaggaag 480 
atgatgttgt gatgggagtt cagtacaagg ataaagagac tgggagatat caaggaactc 540 
catgctccac tgactgttgt tgcagatggg cttttctcca anttcaggaa aagcctggtc 600 
tcaataaagt ttctgtatca ctcatttggt tggcttctta tgaagaatgc nccc 654 

<210> 32 

<211> 673 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 376, 545, 627 
<223> n = A, T, C or G 
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<400> 32 

lactagtgaag aaaaagaaat tctgatacgg gacaaaaatg ctcttcaaaa catcattctt 60 
tatcacctga caccaggagt tttcattgga aaaggatttg aacctggtgt tactaacatt 120 
ttaaagacca cacaaggaag caaaatcttt ctgaaagaag taaatgatac acttctggtg 180 
aatgaattga aatcaaaaga atctgacatc atgacaacaa atggtgtaat tcatgttgta 240 
gataaactcc tctatccagc agacacacct gttggaaatg atcaactgct ggaaatactt 300 
aataaattaa tcaaatacat ccaaattaag tttgttcgtg gtagcacctt caaagaaatc 360 
cccgtgactg tctatnagcc aattattaaa aaatacacca aaatcattga tgggagtgcc 420 
tgtgggaaat aactgaaaaa gagaccgaga agaacgaatc attacaggtc ctgaaataaa 480 
atacctagga tttctactgg aggtggagaa acagaagaac tctgaagaaa ttgttacaag 540 
aagangtccc aaggtcacca aattcattga aggtggtgat ggtctttatt tgaagatgaa 600 
gaaattaaaa gacgcttcag ggagacnccc catgaaggaa ttgccagcca caaaaaaatt 660 
cagggat tag aaa 67 3 

<210> 33 

<211> 673 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 325, 419, 452, 532, 538, 542, 571, 600, 616, 651, 653, 672 
<223> n = A,T,C or G 



<400> 33 

actagttatt tactttcctc cgcttcagaa 
ggatctgttg tttcttttgg gtctcacctc 
gaaggttgaa aggagcaggg aaaagatcca 
tcttgaagta tgatgcatat tgcattattt 
atcatttaga agggcaagtt caagaggata 
tgactaaaaa tgaacattaa tgttnaagac 
tgaaattatg caactttgat atcatattcc 
gaaactttat aaagcatatg gtcagttatt 
ctgcacttaa agaagtctaa cagtacaaat 
tntattttta aatattgtac tatttatggt 
aatttatcat ttcaanggca ttctatttgg 
ttcgctactg tnt 



ggtttttcag actgagagcc taagcatact 60 
atcagtgtgc atagtggcag aaa tt at aaa 120 
gaagcatgtt agttcgacat catcatcttt 180 
tatttgcaaa ctaggaattg cagtctgagg 240 
tgaagatttg agaacttttt aactattcat 300 
ttaagacttt aacctgctgg cagtcccaaa 360 
ttgatttaaa ttgggctttt gtgattgant 420 
tnattaaaaa ggcaaaacct gaaccacctt 480 
acctatctat cttagatgga tntatttntt 540 
nggtggggct ttcttactaa tacacaaatn 600 
gtttagaagt tgattccaag nantgcatat 660 

673 



<210> 34 

<211> 684 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<222> 414, 472, 480, 490, 503, 507, 508, 513, 523, 574, 575, 598, 

659, 662, 675 

<223> n = A,T,C or G 

<4 00> 34 , 
actagtttat tcaagaaaag aacttactga ttcctctgtt cctaaagcaa gagtggcagg 60 
tgatcagggc tggtgtagca tccggttcct ttagtgcagc taactgcatt tgtcactgat 120 
gaccaaggag gaaatcacta agacatttga gaagcagtgg tatgaacgtt cttggacaag 180 
ccacagttct gagccttaac cctgtagttt gcacacaaga acgagctcca cctccccttc 240 
ttcaggagga atctgtgcgg atagattggc tggacttttc aatggttctg ggttgcaagt 300 
gggcactgtt atggctgggt atggagcgga cagccccagg aatcagagcc tcagcccggc 360 
tgcctggttg gaaggtacag gtgttcagca ccttcggaaa aagggcataa agtngtgggg 420 
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gacaattctc agtccaagaa gaatgcattg accattgctg gctatttgct tncctagtan 480 
gaattggatn catttttgac cangatnntt ctnctatgct ttnttgcaat gaaatcaaat 540 
^cccgcattat ctacaagtgg tatgaagtcc tgcnnccccc agagaggctg ttcaggcnat 600 
gtcttccaag ggcagggtgg gttacaccat tttacctccc ctctcccccc agattatgna 660 
cncagaagga atttntttcc tccc $ 84 

<210> 35 

<211> 614 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 17, 20, 152, 223, 267, 287, 304, 306, 316, 319, 321, 355, 
365, 382, 391, 407, 419, 428, 434, 464, 467, 477, 480, 495, 
499, 505, 515, 516, 522, 524, 527, 542, 547, 549, 567, 572, 
576, 578 

<223> n = A,T,C or G 
<400> 35 

actagtccaa cgcgttngcn aatattcccc tggtagccta cttccttacc cccgaatatt 60 
ggtaagatcg agcaatggct tcaggacatg ggttctcttc tcctgtgatc attcaagtgc 120 
tcactgcatg aagactggct tgtctcagtg tntcaacctc accagggctg tctcttggtc 180 
cacacctcgc tccctgttag tgccgtatga cagcccccat canatgacct tggccaagtc 240 
acggtttctc tgtggtcaat gttggtnggc tgattggtgg aaagtanggt ggaccaaagg 300 
aagncncgtg agcagncanc nccagttctg caccagcagc gcctccgtcc tactngggtg 360 
ttccngtttc tcctggccct gngtgggcta nggcctgatt cgggaanatg cctttgcang 420 
gaaggganga taantgggat ctaccaattg attctggcaa aacnatntct aagattnttn 480 
tgctttatgt ggganacana tctanctctc atttnntgct gnanatnaca ccctactcgt 540 
gntcgancnc gtcttcgatt ttcgganaca cnccantnaa tactggcgtt ctgttgttaa 600 
aaaaaaaaaa . aaaa ^4 

<210> 36 

<211> 686 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 222, 224, 237, 264, 285, 548, 551, 628, 643, 645, 665, 674 
<223> n = A, T,C or G 

<400> 36 

gtggctggcc cggttctccg cttctcccca tcccctactt tcctccctcc ctccctttcc 60 
ctccctcgtc gactgttgct tgctggtcgc agactccctg acccctccct cacccctccc 120 
taacctcggt gccaccggat tgcccttctt ttcctgttgc ccagcccagc cctagtgtca 180 
gggcgggggc ctggagcagc ccgaggcact gcagcagaag ananaaaaga cacgacnaac 240 
ctcagctcgc cagtccggtc gctngcttcc cgccgcatgg caatnagaca gacgccgctc 300 
acctgctctg ggcacacgcg acccgtggtt gatttggcct tcagtggcat cacccttatg 360 
ggtatttctt aatcagcgct tgcaaagatg gttaacctat gctacgccag ggagatacag 420 
gagactggat tggaacattt ttggggtcta aaggtctgtt tggggtgcaa cactgaataa 480 
ggatgccacc aaagcagcta cagcagctgc agatttcaca gcccaagtgt gggatgctgt 540 
ctcagganat naattgataa cctggctcat aacacattgt caagaatgtg gatttcccca 600 
ggatattatt atttgtttac cggggganag gataactgtt tcncntattt taattgaaca 660 
aactnaaaca aaanctaagg aaatcc 59 ^ 

<210> 37 
<211> 681 
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<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 7, 10, 11, 19, 25, 32, 46, 53, 77, 93, 101, 103, 109, 115, 
123, 128, 139, 157, 175, 180, 192, 193, 194, 212, 218, 226, 
227, 233, 240, 241, 259, 260, 267, 289, 296, 297, 298, 312, 
313, 314, 320, 325, 330, 337, 345, 346, 352, 353, 356 
<223> n « A,T,C or G 

<221> misc_f eat ure 

<222> 382, 385, 400, 427, 481, 484, 485, 491, 505, 515, 533, 542, 
544, 554, 557, 560, 561, 564, 575, 583, 589, 595, 607, 619, 
628, 634, 641, 645, 658, 670 
<223> n = A,T,C or G 

<400> 37 

gagacanacn naacgtcang agaanaaaag angcatggaa cacaanccag gcncgatggc 60 
caccttccca ccagcancca gcgcccccca gcngccccca ngnccggang accangactc 120 
cancctgnat caatctganc tctattcctg gcccatncct acctcggagg tggangccgn 180 
aaaggtcgca cnnncagaga agctgctgcc ancaccancc gccccnnccc tgncgggctn 240 
nataggaaac tggtgaccnn gctgcanaat tcatacagga gcacgcgang ggcacnnnct 300 
cacactgagt tnnngatgan gcctnaccan ggacctnccc cagcnnattg annacnggac 360 
tgcggaggaa ggaagacccc gnacnggatc ctggccggcn tgccaccccc ccacccctag 420 
gattatnccc cttgactgag tctctgaggg gctacccgaa cccgcctcca ttccctacca 480 
natnntgctc natcgggact gacangctgg ggatnggagg ggctatcccc cancatcccc 540 
tnanaccaac agcnacngan natnggggct ccccngggtc ggngcaacnc tcctncaccc 600 
cggcgcnggc cttcggtgnt gtcctccntc aacrxaattcc naaanggcgg gccccccngt 660 
ggactcctcn ttgttccctc c 681 

<210> 38 

<211> 687 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 3, 30, 132, 151, 203, 226, 228, 233, 252, 264, 279, 306, 
308, 320, 340, 347, 380, 407, 429, 437, 440, 445, 448, 491, 
559, 567, 586, 589, 593, 596, 603, 605, 606, 609, 626, 639, 
655, 674, 682 
<223> n = A,T,C or G 

<400> 38 

canaaaaaaa aaaacatggc cgaaaccagn aagctgcgcg atggcgccac ggcccctctt 60 
ctcccggcct gtgtccggaa ggtttccctc cgaggcgccc cggctcccgc aagcggagga 120 
gagggcggga cntgccgggg ccggagctca naggccctgg ggccgctctg ctctcccgcc 180 
atcgcaaggg cggcgctaac ctnaggcctc cccgcaaagg tccccnangc ggnggcggcg 240 
gggggctgtg anaaccgcaa aaanaacgct gggcgcgcng cgaacccgtc cacccccgcg 300 
aaggananac ttccacagan gcagcgtttc cacagcccan agccacnttt ctagggtgat 360 
gcaccccagt aagttcctgn cggggaagct caccgctgtc aaaaaanctc ttcgctccac 420 
cggcgcacna aggggangan ggcangangc tgccgcccgc acaggtcatc tgatcacgtc 480 
gcccgcccta ntctgctttt gtgaatctcc actttgttca accccacccg ccgttctctc 540 
ctccttgcgc cttcctctna ccttaanaac cagcttcctc tacccnatng tanttnctct 600 
gcncnngtng aaattaattc ggtccnccgg aacctcttnc ctgtggcaac tgctnaaaga 660 
aactgctgtt ctgnttactg cngtccc 687 
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<210> 39 
<211> 695 
<<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 300, 401, 423, 429, 431, 437, 443, 448, 454, 466, 492, 515, 
523, 524, 536, 538, 541, 552, 561, 566, 581, 583, 619, 635, 
636, 641, 649, 661, 694 
<223> n - A, T, C or G 

<400> 39 

actagtctgg cctacaatag tgtgattcat gtaggacttc tttcatcaat tcaaaacccc 60 
tagaaaaacg tatacagatt atataagtag ggataagatt tctaacattt ctgggctctc 120 
tgacccctgc gctagactgt ggaaagggag tattattata gtatacaaca ctgctgttgc 180 
cttattagtt ataacatgat aggtgctgaa ttgtgattca caatttaaaa acactgtaat 240 
ccaaactttt ttttttaact gtagatcatg catgtgaatg ttaatgttaa tttgttcaan 300 
gttgttatgg gtagaaaaaa ccacatgcct taaaatttta aaaagcaggg cccaaactta 360 
ttagtttaaa attaggggta tgtttccagt ttgttattaa ntggttatag ctctgtttag 420 
aanaaatcna ngaacangat ttngaaantt aagntgacat tatttnccag tgacttgtta 480 
atttgaaatc anacacggca ccttccgttt tggtnctatt ggnntttgaa tccaancngg 540 
ntccaaatct tnttggaaac ngtccnttta acttttttac nanatcttat ttttttattt 600 
tggaatggcc ctatttaang ttaaaagggg ggggnnccac naccattcnt gaataaaact 660 
naatatatat ccttggtccc ccaaaattta aggng ~ 695 

<210> 40 

<211> 674 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc feature 

<222> 403, 428, 432, 507, 530, 543, 580, 583, 591, 604, 608, 621, 
624, 626, 639, 672 
<223> n - A,T,C or G 

<400> 40 

actagtagtc agttgggagt ggttgctata ccttgacttc atttatatga atttccactt 60 
tattaaataa tagaaaagaa aatcccggtg cttgcagtag agttatagga cattctatgc 120 
ttacagaaaa tatagccatg attgaaatca aatagtaaag gctgttctgg ctttttatct 180 
tcttagctca tcttaaataa gtagtacact tgggatgcag tgcgtctgaa gtgctaatca 240 
gttgtaacaa tagcacaaat cgaacttagg atgtgtttct tctcttctgt gtttcgattt 300 
tgatcaattc tttaattttg ggaacctata atacagtttt cctattcttg gagataaaaa 360 
ttaaatggat cactgatatt taagtcattc tgcttctcat ctnaatattc catattctgt 420 
attagganaa antacctccc agcacagccc cctctcaaac cccacccaaa accaagcatt 480 
tggaatgagt ctcctttatt tccgaantgt ggatggtata acccatatcn ctccaatttc 540 
tgnttgggtt gggtattaat ttgaactgtg catgaaaagn ggnaatcttt nctttgggtc 600 
aaantttncc ggttaatttg nctngncaaa tccaatttnc tttaagggtg tctttataaa 660 
atttgctatt cngg "* 674 

<210> 41 i 

<211> 657 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc feature 
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<222> 243, 247, 251, 261, 267, 272, 298, 312, 315, 421, 432, 434, 
501, 524, 569, 594, 607, 650 
<223> n « A, T , C or G 



<400> 41 

gaaacatgca agtaccacac actgtttgaa 
gtgatagccc cggaatgtac agtgtcttgg 
accttgggac cctaatgggg cagagagtat 
cctttgggag gctgaagtta aagggaatgg 
atnggtnaca ngatgttaaa ntaaggntct 
acacactcct ancanctggt aaaggggtgc 
agcatgggct gatctgatta cttcctggca 
naaggatggg ananttttcc atatccttgc 
ccctctatta aaaatcactg nccttactac 
tttctctgac ttagttcttg gcatggganc 
ttctccngaa ctcacctact tgaattggta 

<210> 42* 
<211> 389 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<222> 179, 317, 320 
<223> n = A, T, Cor G 



ttttgcacaa aaagtgactg tagggatcag 60 
tgcaccaaga tgccttctaa aggctgacat 120 
agccctagcc cagtggtgac atgaccactc 180 
tatgtgtttt ctcatggaag cagcacatga 240 
antttgggtg tcttgtcatt tgaaaaantg 300 
tggaagccat ggaagaactc taaaaacatt 360 
tcccgctcac ttttatggga agtcttatta 420 
tgttggaact ctggaacact ctctaaattt 480 
acttcctcct tganggaata gaaatggacc 540 
cagcccaaat taaaatctga cttntccggt 600 
aaacctcctt tggaattagn aaaaacc 657 



<400> 42 

actagtgctg aggaatgtaa acaagtttgc 
cgatagctca cactcctgca ctgtgcctgt 
caggaagaaa acaaaaacca gactgtgtcc 
ggccttcacc gccaccaggg tgtcccgcca 
atcctgaaga attcctgttt gggggttgtg 
tgttgcctgc ccgcgtngtn gggaagggac 
atattttaag ttaagaaaaa aaaaaaaaa 

<210> 43 
<211> 279 
<212> DNA 

<213> Homo sapiens 



tgggccttgc gagacttcac caggttgttt 60 
cacccaggaa tgtctttttt aattagaaga 120 
cacaatcaga aacctccgtt gtggcagang 180 
gacagggaga gactccagcc ttctgaggcc 240 
aaggaaaatc acccggattt aaaaagatgc 300 
tggtttcctg gtgaatttct taaaagaaaa 360 

389 



<400> 43 

actagtgaca agctcctggt cttgagatgt cttctcgtta aggagatggg ccttttggag 60 
gtaaaggata aaatgaatga gttctgtcat gattcactat tctagaactt gcatgacctt 120 
tactgtgtta gctctttgaa tgttcttgaa attttagact ttctttgtaa acaaataata 180 
tgtccttatc attgtataaa agctgttatg tgcaacagtg tggagatcct tgtctgattt 240 
aataaaatac ttaaacactg aaaaaaaaaa aaaaaaaaa 279 

<210> 44 

<211> 449 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 245, 256, 264, 266, 273, 281, 323, 325, 337, 393 
<223> n - A,T,C or G 
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<400> 44 

actagtagca tcttttctac aacgttaaaa 
^caacaacaac aataacaata aatcctaagt 
atatcagcct gttttttccc ttttttctcc 
tctacagcct ctttcctctt ctcatgcttg 
aagantgggc tgtttngctt ggantncggt 
gttggaagaa actcaaacct tcnancccta 
atttttgtac tggcattaac aaaaaaagaa 
aactttaaaa gggaaaaaaa aaaaaaaaa 

<210> 45 

<211> 559 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 263 

<223> n=A,T,CorG 



ttgcagaagt agcttatcat taaaaaacaa 60 
gtaaatcagt tattctaccc cctaccaagg 120 
tgggaataat tgtgggcttc ttcccaaatt 180 
agcttccctg tttgcacgca tgcgttgtgc 240 
ccnagtggaa ncatgctttc ccttgttact 300 
ggtgttncca ttttgtcaag tcatcactgt 360 
atnaaatatt gttccattaa actttaataa 420 

449 



<400> 45 

actagtgtgg gggaatcacg gacacttaaa gtcaatctgc gaaataattc ttttattaca 60 
cactcactga agtttttgag tcccagagag ccattctatg tcaaacattc caagtactct 120 
ttgagagccc agcattacat caacatgccc gtgcagttca aaccgaagtc cgcaggcaaa 180 
tttgaagctt tgcttgtcat tcaaacagat gaaggcaaga gtattgctat tcgactaatt 240 
ggtgaagctc ttggaaaaaa ttnactagaa tactttttgt gttaagttaa ttacataagt 



300 



tgtattttgt taactttatc tttctacact acaattatgc ttttgtatat atattttgta 360 
tgatggatat ctataattgt agattttgtt tttacaagct aatactgaag actcgactga 420 
aatattatgt atctagccca tagtattgta cttaactttt acagggtgaa aaaaaaattc 480 
tgtgtttgca ttgattatga tattctgaat aaatatggga atatatttta atgtgggtaa 540 
aaaaaaaaaa aaaaaggaa " J 5jg 

<210> 46 

<211> 731 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 270, 467, 477, 502, 635, 660, 671, 688, 695, 697, 725 
<223> n = A, T, C or G 

<400> 46 

actagttcta gtaccatggc tgtcatagat gcaaccatta tattccattt agtttcttcc 60 
tcaggttccc taacaattgt ttgaaactga atatatatgt ttatgtatgt gtgtgtgttc 120 
actgtcatgt atatggtgta tatgggatgt gtgcagtttt cagttatata tatattcata 180 
tatacatatg catatatatg tataatatac atatatacat gcatacactt gtataatata 240 
catatatata cacatatatg cacacatatn atcactgagt tccaaagtga gtctttattt 300 
ggggcaattg tattctctcc ctctgtctgc tcactgggcc tttgcaagac atagcaattg 360 
lit "f C ttfc gg ataa 9 agtcttatct tcggcactct tgactctagc cttaacttta 420 
gatttctatt ccagaatacc tctcatatct atcttaaaac ctaaganggg taaagangtc 480 
ataagattgt agtatgaaag antttgctta gttaaattat atctcaggaa actcattcat 540 
f^ 33 ^ aatt 9taaaa tgatggtttg ttgtatctga aaaaatgttt agaacaagaa 600 
atgtaactgg gtacctgtta tatcaaagaa cctcnattta ttaagtctcc tcatagccan 660 
ta° C ntt ngccctctct ^cctgantt aatananact tgaataatga atagttaatt 720 



<210> 47 
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<211> 640 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> mis cofeature 

<222> 5, 28, 106, 153, 158, 173, 176, 182, 189, 205, 210, 214, 
225, 226, 229, 237, 260, 263, 269, 277, 281, 282, 322, 337, 
338, 354, 365, 428, 441, 443, 456, 467, 476, 484, 503, 508, 
554, 567, 575, 579, 588, 601, 606, 609, 611, 621, 636 
<223> n = A,T,C or G 



<400> 47 

tgcgngccgg tttggccctt ctttgtanga 
cgttaataac tcctcaggtc cctgcctgca 
gtacaccaaa tgtgacatcc tttcaccaat 
anacgactnc aacaattttt tgatnacccn 
ggagcagcat ggacctgtcn gcnactaang 
ttggtatgtc ttactgaaag anagaaacat 
caganattgc caatgccaag tccgagcggt 
tacatacntt gtccccgaaa nanaagatgc 
acanctacac ctggtgcttg ganaacanac 
cccagtgggt tttnccttgg cacctanctt 
ntggcnttnt nttgggacca ntcttctcac 

<210> 48 
<211> 257 
<212> DNA 
<213> Homo 

<400> 48 

actagtatat gaaaatgtaa atatcacttg 
ccaccttgag cagccttgga aacctaacct 
tgattttctt tgttcctgaa aaagtgattt 
ttatatttgt atatgtatca tcataaaata 
aaaaaaaaaa aaaaaaa 

<210> 49 
<211> 652 
<212> DNA 
<213> Homo 

<220> 
<221> misc_feature 
<222> 410, 428, 496, 571, 647 
<223> n ■» A,T,C or G 

<400> 49 

actagttcag atgagtggct gctgaagggg 
tccacttatt tgaactctta agtcataaat 
gttgacacta gaaactgccc atttctgtat 
tggggaaaaa aatcttattt taaaatggct 
taaacttctt tctgtttcca aaacttgaaa 
ttttcaaagc tttcctcaca tttttaaagt 
ttctttaaag cagctatatc ccaacccatg 
aacagcangg ttattgaagc agctttctca 
tttattgtat ttgtanaata caatttttgt 
gatgcttttc atatagagtg aaatatccca 



cactttcatc cgccctgaaa tcttcccgat 60 
cagggttttt tcttantttg ttgcctaaca 120 
atngattnct tcataccaca tcntcnatgg 180 
aaanactggg ggctnnaana agtacantct 240 
gaacaanagt nntgaacatt tacacaacct 300 
gcttctnncc ctagaccacg aggncaaccg 360 
tagatcaggt aatacattcc atggatgcat 420 
cctaanggct tcttcanact ggtccngaaa 480 
tctttggaag atcatctggc acaagttccc 540 
accanatcna ttcggaancc attctttgcc 600 
aactgnaccc 640 



tgtactcaaa caaaagttgg tcttaagctt 60 
gcctctttta gcataatcac attttctaaa 120 
gtattagttt tacatttgtt ttttggaaga 180 
tttaaataaa aagtatcttt agagtgaaaa 240 

257 



cccccttgtc attttcatta taacccaatt 60 
gtataatgac ttatgaatta gcacagttaa 120 
tacactatca aataggaaac attggaaaga 180 
tagaaagttt tcagattact ttgaaaattc 240 
atatgtagat ggactcatgc attaagactg 300 
gtgattttcc ttttaatata catatttatt 360 
actttggaga tatacctatn aaaccaatat 420 
aatgttgctt cagatgtgca agttgcaaat 480 
tttaaactgt atttcaatct atttctccaa 540 
ngataactgc ttctgtgtcg tcgcatttga 600 



sapiens 



sapiens 



WO 02/00174 



21 



PCT/US01/21065 



cgcataactg cacaaatgaa cagtgtatac ctcttggttg tgcattnacc cc ' 652 

^<210> 50 
<211> 650 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_f eature 

<222> 237, 270, 311, 443, 454, 488, 520, 535, 539, 556, 567, 594, 
603, 634 

<223> n = A,T,C or G 
<400> 50 

ttgcgctttg atttttttag ggcttgtgcc ctgtttcact tatagggtct agaatgcttg 60 
tgttgagtaa aaaggagatg cccaatattc aaagctgcta aatgttctct ttgccataaa 120 
gactccgtgt aactgtgtga acacttggga tttttctcct ctgtcccgag gtcgtcgtct 180 
gctttctttt ttgggttctt tctagaagat tgagaaatgc atatgacagg ctgagancac 240 
ctccccaaac acacaagctc tcagccacan gcagcttctc cacagcccca gcttcgcaca 300 
ggctcctgga nggctgcctg ggggaggcag acatgggagt gccaaggtgg ccagatggtt 360 
ccaggactac aatgtcttta tttttaactg tttgccactg ctgccctcac ccctgcccgg 420 
ctctggagta ccgtctgccc canacaagtg ggantgaaat gggggtgggg gggaacactg 480 
attcccantt agggggtgcc taactgaaca gtagggatan aaggtgtgaa cctgngaant 540 
gcttttataa attatnttcc ttgttanatt tattttttaa tttaatctct gttnaactgc 600 
ccngggaaaa ggggaaaaaa aaaaaaaaat tctntttaaa cacatgaaca 650 

<210> 51 

<211> 545 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 66, 159, 195, 205, 214, 243, 278, 298, 306, 337, 366, 375, 
382, 405, 446, 477, 492, 495, 503, 507, 508, 521, 537 
<223> n = A,T,C or G • 

<400> 51 

tggcgtgcaa ccagggtagc tgaagtttgg gtctgggact ggagattggc cattaggcct 60 
cctganattc cagctccctt ccaccaagcc cagtcttgct acgtggcaca gggcaaacct 120 
gactcccttt gggcctcagt ttcccctccc cttcatgana tgaaaagaat actacttttt 180 
cttgttggtc taacnttgct ggacncaaag tgtngtcatt attgttgtat tgggtgatgt 240 
gtncaaaact gcagaagctc actgcctatg agaggaanta agagagatag tggatganag 300 
ggacanaagg agtcattatt tggtatagat ccacccntcc caacctttct ctcctcagtc 360 
cctgcncctc atgtntctgg tntggtgagt cctttgtgcc accanccatc atgctttgca 420 
ttgctgccat cctgggaagg gggtgnatcg tctcacaact tgttgtcatc gtttganatg 480 
catgctttct tnatnaaaca aanaaannaa tgtttgacag ngtttaaaat aaaaaanaaa 540 



<210> 52 
<211> 678 

<212> DNA i 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 98, 119, 121, 131, 136, 139, 140, 142, 143, 163, 168, 172, 
176, 184, 189, 190, 191, 200, 201, 205, 207, 221, 223, 229, 
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230, 237, 240, 241, 255, 264, 266, 267, 276, 280, 288, 289, 
291, 297, 301, 306, 308, 314, 315, 326, 332, 335, 337 
«223> n = A,T,C or G 



<221> mis cofeature 

<222> 339, 341, 343, 344, 345, 347, 350, 355, 356, 358, 362, 363, 
372, 379, 395, 397, 398, 400, 403, 412, 414, 421, 423, 431, 
435, 438, 439, 450, 457, 463, 467, 471, 474, 480, 483, 484, 
487, 490, 491, 492, 493, 499, 500, 504, 508, 518, 536 
f <223> n = A, T,C or G 

<221> misc_feature 

<222> 538, 549, 551, 552, 554, 556, 557, 562, 563, 567, 571, 572, 
576, 579, 590, 592, 595, 598, 606, 609, 613, 620, 622, 624, 
626, 631, 634, 638, 641, 647, 654, 660, 661, 674 
<223> n = A,T,C or G 



<400> 52 

actagtagaa gaactttgcc gcttttgtgc 
ggaggaagac gatttggggg gggagggggg 
ntatctccat ntccantgnn cnntgtcgcc 
tggnccccnn nccctctccn ncctncncct 
ncttccccat ctccntcccc cctnanngtc 
nctccncncc tccnnccgtt cttctnttct 
annctctccc cnctgcaanc gattctctcc 
ncncgctcct nttcntcnnc ccacctctcn 
cgnntcnttn nnntcctcnn accncccncc 
tctctcccnc nncncnncct cnncccntcc 
ccttcntcnc cantccatcn cntntnccat 
ntctctttca cacngtcc 



ctctcacagg cgcctaaagt cattgccatg 60 
gggggcangg tccgtggggG tttccctant 120 
tcttccctcg tcncattnga anttantccc 180 
cccccctccg ncncctccnn ctttttntan 240 
ccaacnccgn cagcaatnnc ncacttnctc 300 
cnacntntnc ncnnntnccn tgccnntnaa 360 
ctccncnnan ctntccactc cntncttctc 420 
ccttcgnccc cantacnctc nccncccttn 480 
tcccttcncc cctcttctcc ccggtntntc 540 
nngcgnccnt ttccgccccn cnccnccntt 600 
nctncctncc nctcacnccc gctncccccn 660 

678 



<210> 53 

<211> 502 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> miscjieature 

<222> 139, 146, 215, 217, 257, 263, 289, 386, 420, 452, 457, 461, 

466, 482, 486 

<223> n = A, T, C or G 

<400> 53 

tgaagatcct ggtgtcgcca tgggccgccg ccccgcccgt tgttaccggt attgtaagaa 60 

caagccgtac ccaaagtctc gcttctgccg aggtgtccct gatgccaaaa ttcgcatttt 120 

tgacctgggg cggaaaaang caaaantgga tgagtctccg ctttgtggcc acatggtgtc 180 

agatcaatat gagcagctgt cctctgaagc cctgnangct gcccgaattt gtgccaataa 240 

gtacatggta aaaagtngtg gcnaagatgc ttccatatcc gggtgcggnt ccaccccttc 300 

cacgtcatcc gcatcaacaa gatgttgtcc tgtgctgggg ctgacaggct cccaacaggc 360 

atgcgaagtg cctttggaaa acccanggca ctgtggccag ggttcacatt gggccaattn 420 

atcatgttca tccgcaccaa ctgcagaaca angaacntgt naattnaagc cctgcccagg 480 

gncaanttca aatttcccgg cc 1 502 



<210> 54 

<211> 4 94 

<212> DNA 

<213> Homo sapiens 
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<220> 

<221> misc_feature 
4<222> 431, 442 f 445 
<223> n = A,T,C or G 

<400> 54 

actagtccaa gaaaaatatg cttaatgtat attacaaagg ctttgtatat gttaacctgt 60 
tttaatgcca aaagtttgct ttgtccacaa tttccttaag acctcttcag aaagggattt 120 
gtttgcctta atgaatactg ttgggaaaaa acacagtata atgagtgaaa agggcagaag 180 
caagaaattt ctacatctta gcgactccaa gaagaatgag tatccacatt tagatggcac 240 
attatgagga ctttaatctt tccttaaaca caataatgtt ttcttttttc ttttattcac 300 
atgatttcta agtatatttt tcatgcagga cagtttttca accttgatgt acagtgactg 360 
tgttaaattt ttctttcagt ggcaacctct ataatcttta aaatatggtg agcatcttgt 420 
ctgttttgaa ngggatatga cnatnaatct atcagatggg aaatcctgtt tccaagttag 480 
aaaaaaaaaa aaaa 494 

<210> 55 

<211> 606 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 375, 395, 511, 542, 559, 569, 578, 581 
<223> n = A, T,C or G 

<400> 55 

actagtaaaa agcagcattg ccaaataatc cctaattttc cactaaaaat ataatgaaat 60 
gatgttaagc tttttgaaaa gtttaggtta aacctactgt tgttagatta atgtatttgt 120 
tgcttccctt tatctggaat gtggcattag cttttttatt ttaaccctct ttaattctta 180 
ttcaattcca tgacttaagg ttggagagct aaacactggg atttttggat aacagactga 240 
cagttttgca taattataat cggcattgta catagaaagg atatggctac cttttgttaa 300 
atctgcactt tctaaatatc aaaaaaggga aatgaagtat aaatcaattt ttgtataatc 360 
tgtttgaaac atgantttta tttgcttaat attanggctt tgcccttttc tgttagtctc 420 
ttgggatcct gtgtaaaact gttctcatta aacaccaaac agttaagtcc attctctggt 480 
actagctaca aattccgttt catattctac ntaacaatti aaattaactg aaatatttct 540 
anatggtcta cttctgtcnt ataaaaacna aacttgantt nccaaaaaaa aaaaaaaaaa 600 
aaaaaa 606 

<210> 56 

<211> 183 

<212> DNA 

<213> Homo sapiens 

<400> 56 

actagtatat ttaaacttac aggcttattt gtaatgtaaa ccaccatttt aatgtactgt 60 
aattaacatg gttataatac gtacaatcct tccctcatcc catcacacaa ctttttttgt 120 
gtgtgataaa ctgattttgg tttgcaataa aaccttgaaa aataaaaaaa aaaaaaaaaa 180 
aaa 183 
<210> 57 

<211> 622 * 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 358, 368, 412, 414, 425, 430, 453, 455, 469, 475, 495, 499, 
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529, 540, 564, 575, 590 
<223> n = A,T,C or G 

<400> 57 

actagtcact actgtcttct ccttgtagct aatcaatcaa tattcttccc ttgcctgtgg 60 
gcagtggaga gtgctgctgg gtgtacgctg cacctgccca .ctgagttggg gaaagaggat 120 
aatcagtgag cactgttctg ctcagagctc ctgatctacc ccacccccta ggatccagga 180 
ctgggtcaaa gctgcatgaa accaggccct ggcagcaacc tgggaatggc tggaggtggg 240 
agagaacctg acttctcttt ccctctccct cctccaacat tactggaact ctatcctgtt 300 
agggatcttc tgagcttgtt tccctgctgg gtgggacaga agacaaagga gaagggangg 360 
tctacaanaa gcagcccttc tttgtcctct ggggttaatg agcttgacct ananttcatg 420 
gaganaccan aagcctctga tttttaattt ccntnaaatg tttgaagtnt atatntacat 480 
atatatattt ctttnaatnt ttgagtcttt gatatgtctt aaaatccant ccctctgccn 540 
gaaacctgaa ttaaaaccat gaanaaaaat gtttncctta aagatgttan taattaattg 600 
aaacttgaaa aaaaaaaaaa aa 622 

<210> 58 

<211> 433 

<212> DNA 

<213> Homo sapiens 

<400> 58 

gaacaaattc tgattggtta tgtaccgtca aaagacttga agaaatttca tgattttgca 60 
gtgtggaagc gttgaaaatt gaaagttact gcttttccac ttgctcatat agtaaaggga 120 
tcctttcagc tgccagtgtt gaataatgta tcatccagag tgatgttatc tgtgacagtc 180 
accagcttta agctgaacca ttttatgaat accaaataaa tagacctctt gtactgaaaa 240 
catatttgtg actttaatcg tgctgcttgg atagaaatat ttttactggt tcttctgaat 300 
tgacagtaaa cctgtccatt atgaatggcc tactgttcta ttatttgttt tgacttgaat 360 
ttatccacca aagacttcat ttgtgtatca tcaataaagt tgtatgtttc aactgaaaaa 420 
aaaaaaaaaa aaa 433 

<210> 59 

<211> 649 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 22, 190, 217, 430, 433, 484, 544, 550, 577, 583, 594 
<223> n = A, T,C or G 

<400> 59 

actagttatt atctgacttt cnggttataa tcattctaat gagtgtgaag tagcctctgg 60 

tgtcatttgg atttgcattt ctctgatgag tgatgctatc aagcaccttt gctggtgctg 120 

ttggccatat gtgtatgttc cctggagaag tgtctgtgct gagccttggc ccacttttta 180 

attaggcgtn tgtcttttta ttactgagtt gtaaganttc tttatatatt ctggattcta 240 

gacccttatc agatacatgg tttgcaaata ttttctccca ttctgtgggt tgtgttttca 300 

ctttatcgat aatgtcctta gacatataat aaatttgtat tttaaaagtg acttgatttg 360 

ggctgtgcaa ggtgggctca cgcttgtaat cccagcactt tgggagactg aggtgggtgg 420 

atcatatgan gangctagga gttcgaggtc agcctggcca gcatagcgaa aacttgtctc 480 

tacnaaaaat acaaaaatta gtcaggcatg gtggtgcacg tctgtaatac c^agcttctca 540 

ggangctgan gcacaaggat cacttgaacc ccagaangaa gangttgcag tganctgaag 600 

atcatgccag ggcaacaaaa atgagaactt gtttaaaaaa aaaaaaaaa 64 9 

<210> 60 

<211> 423 

<212> DNA 

<213> Homo sapiens 
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<220> 

-<221> misc_feature 
<222> 209, 222, 277, 389, 398 
<223> n = A,T,C or G 



<400> 60 

actagttcag gccttccagt 
acctggcagt gataccatca 
t gaagtgagcg ctgggctgtt 
tcttctgtat tttttttttc 
tgtcttacaa. ggcagggctt 
aggaatggcc tgagttggcg 
caacccatta atcttttgta 
aaa 

<210> 61 
<211> 423 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 195, 285, 295, 329, 335, 340, 347, 367, 382, 383, 391, 396, 
418 

<223> n = A,T,C or G 
<400> 61 

cgggactgga atgtaaagtg aagttcggag ctctgagcac gggctcttcc cgccgggtcc 60 
tccctcccca gaccccagag ggagaggccc accccgccca gccccgcccc agcccctgct 120 
caggtctgag tatggctggg agtcgggggc cacaggcctc tagctgtgct gctcaagaag 180 
actggatcag ggtanctaca agtggccggg ccttgccttt gggattctac cctgttccta 240 
atttggtgtt ggggtgcggg gtccctggcc cccttttcca cactncctcc ctccngacag 300 
caacctccct tggggcaatt gggcctggnt ctccncccgn tgttgcnacc ctttgttggt 360 
ttaaggnctt taaaaatgtt annttttccc ntgccngggt^ taaaaaagga aaaaactnaa 420 



<210> 62 
<211> 683 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc feature 
<222> 218, 291, 305, 411, 416, 441, 443, 453, 522, 523, 536, 542, 
547, 566, 588, 592, 595, 603, 621, 628, 630, 632, 644, 645, 
648, 655, 660, 672, 674, 676, 677, 683 
<223> n = A,T,C or G 



tcactgacaa acatggggaa 
agcctgatgt ccaaaagagc 
ttagtgccag gctgcggtgg 
cattagtana acacaagact 
tcctacaggg ggtgganaaa 
ttgtgggcag gctactggtt 
gtttgtatna aacttganct 



gtgtgcccag ctggctggaa 60 
aaagaatatt tctccaagca 120 
gcagccatga gaacaaaacc 180 
cngattcagc cgaattgtgg 240 
acagcctttc ttcctttggt 300 
tgtatgatgt attagtagag 360 
gagaccttaa acaaaaaaaa 420 

423 



<400> 62 

gctggagagg ggtacggact ttcttggagt 
gaagagaccc taagagactg gggaatggtt 
gctgtcaaca cttaaaggaa gtccccttga 
tggggccact ggccatggtc cgtggacaag 
ggatcaaaat gtgtacttgt ggggtctcgc 
tgtcnttgga ctttcttccc attccctcct 
ccctcctgtg tttttggaat tctgtttccc 
atgaacttat gtttggggtc nangttcccc 



tgtcccaggt tggaatgaga cfigaactcaa 60 
cctgccttca ggaaagtgaa agacgcttag 120 
agcccagagt ggacagacta gacccattga 180 
acattccngt gggccatggc acaccggggg 240 
cccttgccaa aaccaaacca ntcccactcc 300 
ccccaaatgc acttcccctc ctccctctgc 360 
tcaaaattgt taatttttta nttttngacc 420 
ttnccaatgc atactaatat attaatggtt 480 
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atttattttt gaaatatttt ttaatgaact 
cnttttnttt ggggggggtg gggggntggg 
^ttnttacttg gggcccccct naaaaaantn 
ctaaaaaaaa ananannaaa aan 



tggaaaaaat tnntggaatt tccttncttc 540 
ttaaaatttt tttggaancc cnatnggaaa 600 
anttccaatt cttnnatngc ccctnttccn 660 

683 



<210> 63 

<211> 731 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 237, 249, 263, 288, 312, 317, 323, 326, 337, 352, 362, 370, 
377, 400, 411, 414, 434, 436, 446, 457, 473, 486, 497, 498, 
502, 512, 531, 546, 554, 563, 565, 566, 588, 597, 608, 611, 
613, 615, 627, 632, 640, 641, 644, 654, 660, 663, 665 
<223> n ■> A,T,C or G 



<221> misc_feature 

<222> 671, 678, 692, 697; 698, 699, 704, 705, 712, 714, 717, 718, 
719, 723, 725, 730, 731 
<223> n = A, T, C or G 

<400> 63 

actagtcata aagggtgtgc gcgtcttcga cgtggcggtc ttggcgccac tgctgcgaga 60 
cccggccctg gacctcaagg tcatccactt ggtgcgtgat ccccgcgcgg tggcgagttc 120 
acggatccgc tcgcgccacg gcctcatccg tgagagccta caggtggtgc gcagccgaga 180 
ccgcgagctc accgcatgcc cttcttggag gccgogggcc acaagcttgg cgcccanaaa 240 
gaaggcgtng ggggcccgca aantaccacg ctctgggcgc tatggaangt cctcttgcaa 300 
taatattggt tnaaaanctg canaanagcc cctgcanccc cctgaactgg gntgcagggc 360 
cncttacctn gtttggntgc ggttacaaag aacctgtttn ggaaaaccct nccnaaaacc 420 
ttccgggaaa attntncaaa tttttnttgg ggaattnttg ggtaaacccc ccnaaaatgg 480 
gaaacntttt tgccctnnaa antaaaccat tnggttccgg gggccccccc ncaaaaccct 540 
tttttntttt tttntgcccc cantnncccc ccggggcccc tttttttngg ggaaaanccc 600 
cccccctncc nanantttta aaagggnggg anaatttttn nttncccccc gggncccccn 660 
ggngntaaaa nggtttcncc cccccgaggg gnggggnnnc ctcnnaaacc cntntcnnna 720 
ccncnttttn n 731 



<210> 64 

<211> 313 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc feature 
<222> 240 

<223> n=A,T,Cor G 



<400> 64 

actagttgtg caaaccacga ctgaagaaag 
gttagagatg gttgctacac atgttgggtc 
taaagttgat agagaatatg aagaatgcat 
gattagagat aagtatgaga agaaagctac 
aaatgttgat catgtatata tatccatagt 
aaaaaaaaaa aaa 



acgaaaagtg ggaaataact tgcaacgtct 60 
tgtagagaaa catcttgagg agcagattgc 120 
gtcagaagat ctctcggaaa atattaaaga 180 
tctaattaag tcttctgaag aatgaagatn 240 
gaataaaatt gtctcagtaa agttgtaaaa 300 

313 



<210> 65 
<211> 420 
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<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<222> 400, 402, 403, 404, 405, 406, 409, 411, 412, 414, 415, 416 
<223> n - A,T,C or G 



<400> 65 

t actagttccc tggcaggcaa gggcttccaa 
caggaagctg gcagtggcag cttctgtgtc 
tctgggaggt tggagggaag aatctaggcc 
gtagatactg ccttaacact ccctcctctc 
ctccgtgctc actaatttat ttccaggaaa 
atttgtttta acattttcat tgcaagtatt 
acacaaatta atgatattaa aaagcatcca 



ctgaggcagt gcatgtgtgg cagagagagg 60 
tagggagggg tgtggctccc tccttccctg 120 
ttagcttgcc ctcctgccac ccttcccctt 180 
tcagctgtgg ctgccaccca agccaggttt 240 
ggtgtgtgga agacatgagc cgtgtataat 300 
gaccatcatc cttggttgtg tatcgttgta 360 
aacaaagccn annnnnaana nnannngaaa 420 



<210> 66 
<211> 676 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<222> 328, 454, 505, 555, 586, 612, 636, 641 
<223> n « A,T,C or G 

<400> 66 

actagtttcc tat gat cat t aaactcattc tcagggttaa gaaaggaatg taaatttctg 60 
cctcaatttg tacttcatca ataagttttt gaagagtgca gatttttagt caggtcttaa 120 
aaataaactc acaaatctgg atgcatttct aaattctgca aatgtttcct ggggtgactt 180 
aacaaggaat aatcccacaa tatacctagc tacctaatac atggagctgg ggctcaaccc 240 
actgttttta aggatttgcg cttacttgtg gctgaggaaa aataagtagt tccgagggaa 300 
gtagttttta aatgtgagct tatagatngg aaacagaata, tcaacttaat tatggaaatt 360 
gttagaaacc tgttctcttg ttatctgaat cttgattgca attactattg tactggatag 420 
actccagccc attgcaaagt ctcagatatc ttanctgtgt agttgaattc cttggaaatt 480 
ctttttaaga aaaaattgga gtttnaaaga aataaacccc tttgttaaat gaagcttggc 540 
tttttggtga aaaanaatca tcccgcaggg cttattgttt aaaaanggaa ttttaagcct 600 
ccctggaaaa anttgttaat taaatgggga aaatgntggg naaaaattat ccgttagggt 660 
ttaaagggaa aactta 676 

<210> 67 
<211> 620 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<222> 419, 493, 519, 568, 605, 610 
<223> n - A,T,C or G 

<400> 67 

caccattaaa gctgcttacc aagaacttcc ccagcatttt gacttccttg tttgatagct 60 
gaattgtgag caggtgatag aagagccttt ctagttgaac atacagataa tttgctgaat 120 
acattccatt taatgaaggg gttacatctg ttacgaagct actaagaagg agcaagagca 180 
taggggaaaa aaatctgatc agaacgcatc aaactcacat gtgccccctc tactacaaac 240 
agattgtagt gctgtggtgg tttattccgt tgtgcagaac ttgcaagctg agtcactaaa 300 



WO 02/00174 



PCT/US01/21065 



28 



cccaaagaga ggaaattata ggttagttaa acattgtaat cccaggaact aagtttaatt 360 
cacttttgaa gtgttttgtt ttttattttt ggtttgtctg atttactttg ggggaaaang 420 
btaaaaaaaa agggatatca atctctaatt cagtgcccac taaaagttgt ccctaaaaag 480 
tctttactgg aanttatggg actttttaag ctccaggtnt tttggtcctc caaattaacc 540 
ttgcatgggc cccttaaaat tgttgaangg cattcctgcc tctaagtttg gggaaaattc 600 
ccccnttttn aaaatttgga 620 

<210> 68 
<211> 551 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> mis cofeature 

<222> 286, 464, 480, 501, 502, 518, 528, 533, 536, 537, 538, 539, 
540, 541, 543, 544, 545, 547, 548, 549 
<223> n - A, T,C or G 



<400> 68 

actagtagct ggtacataat cactgaggag 
ctaatgctag accagtattt aagggctaat 
ttagaacaga cctctctgtg caataacttg 
gtattggggt tgcaatgact cccaagggcc 
tctgagactg tggtgaaact ccttccaagg 
actcggcacc actttgatat tcaacaagcc 
tacagctgat ggaactcaat ttgaaccttc 
ttaaacctaa ttacatttgt ctagcattgg 
cctatgtgct cccctccccc nnatcttaat 
nannnannna a 



ctatttctta acatgctttt atagaccatg 60 
ctcacacctc cttagctgta agagtctggc 120 
tggccactgg aaatccctgg gccggcattt 180 
aaaagagtta aaggcacgac tgggatttct 240 
ctgagggggt cagtangtgc tctgggaggg 300 
acttgaagcc caattataaa attgttattt 360 
aaaactttgt tagtttatcc tattatattg 420 
atttggttcc tgtngcatat gtttttttcn 480 
ttaaaccnca attttgcnat tcnccnnnnn 540 

551 



<210> 69 
<211> 396 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> miscJEeature 
<222> 235, 310, 323, 381 
<223> n - A,T,C or G 



<400> 69 

cagaaatgga aagcagagtt ttcatttctg 
gcagagtttt cattaaatcc ttttaccttt 
gtatgtggga tattgaatgt taaagggata 
aattaagcaa atgttaaaag ttttatatgc 
tgtgatacat tttttaagct tcagttgctt 
ggggagccan aaaccaatct acnatctctt 
aaaaataaat aaaaactatt nagaaattga 

<210> 70 
<211> 536 
<212> DNA 

<213> Homo sapiens 



tttataaacg tctccaaaca aaaatggaaa 60 
tttttttctt ggtaatcccc tcaaataaca 120 
tttttttcta ttatttttat aattgtacaa 180 
tttattaatg ttttcaaaag gtatnataca 240 
gtcttctggt actttctgtt atgggctttt 300 
tttgtttgcc aggacatgca ataaaattta 360 
aaaaaa 396 



<220> 

<221> misc_feature 
<222> 388, 446, 455 
<223> n - A,T,C or G 
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<400> 70 

iactagtgcaa aagcaaatat aaacatcgaa aaggcgttcc tcacgttagc tgaagatatc 60 
cttcgaaaga cccctgtaaa agagcccaac agtgaaaatg tagatatcag cagtggagga 120 
ggcgtgacag gctggaagag ca'aatgctgc tgagcattct cctgttccat cagttgccat 180 
ccactacccc gttttctctt cttgctgcaa aataaaccac tctgtccatt tttaactcta 240 
aacagatatt tttgtttctc atcttaacta tccaagccac ctattttatt tgttctttca 300 
tctgtgactg cttgctgact ttatcataat tttcttcaaa caaaaaaatg tatagaaaaa 360 
tcatgtctgt gacttcattt ttaaatgnta cttgctcagc tcaactgcat ttcagttgtt 420 
ttatagtcca gttcttatca acattnaaac ctatngcaat catttcaaat ctattctgca 480 
aattgtataa gaataaaagt tagaatttaa caattaaaaa aaaaaaaaaa aaaaaa 536 

<210> 71 

<211> 865 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<222> 22, 35, 39, 56, 131, 138, 146, 183, 194, 197, 238, 269, 277, 
282, 297, 316, 331, 336, 340, 341, 346, 349, 370, 376, 381, 
382, 392, 396, 397, 401, 433, 444, 445, 454, 455, 469, 472, 
477, 480, 482, 489, 497, 499, 511, 522, 526, 527 
<223> n = A, T, C or G 

. <221> misc_f eature 

<222> 545, 553, 556, 567, 574, 580, 610, 613, 634, 638, 639, 663, 
672, 689, 693, 694, 701, 704, 713, 723, 729, 732, 743, 744, 
749, 761, 765, 767, 769, 772, 774, 780, 783, 788, 792, 803, 
810, 824, 840, 848 
<223> n — A, T,C or G 

<400> 71 

gacaaagcgt taggagaaga anagaggcag ggaanactnc ccaggcacga tggccncctt 60 
cccaccagca accagcgccc cccaccagcc cccaggcccg gacgacgaag actccatcct 120 
ggattaatct nacctctntc gcctgnccca ttcctacctc ggaggtggag gccggaaagg 180 
tcncaccaag aganaanctg ctgccaacac caaccgcccc agccctggcg ggcacganag 240 
gaaactggtg accaatctgc agaattctna gaggaanaag cnaggggccc cgcgctnaga 300 
cagagctgga tatgangcca gaccatggac nctacncccn ncaatncana cgggactgcg 360 
gaagatggan gacccncgac nngatcaggc cngctnncca nccccccacc cctatgaatt 420 
attcccgctg aangaatctc tgannggctt ccannaaagc gcctccccnc cnaacgnaan 480 
tncaacatng ggattanang ctgggaactg naaggggcaa ancctnnaat atccccagaa 540 
acaanctctc ccnaanaaac tggggcncct catnggtggn accaactatt aactaaaccg 600 
cacgccaagn aantataaaa ggggggcccc tccncggnng accccctttt gtcccttaat 660 
ganggttatc cnccttgcgt accatggtnc ccnnttctgt ntgnatgttt ccnctcccct 720 
ccncctatnt cnagccgaac tcnnatttnc ccgggggtgc natcnantng tncncctttn 780 
ttngttgncc cngccctttc cgncggaacn cgtttccccg ttantaacgg cacccggggn 840 
aagggtgntt ggccccctcc ctccc 865 

<210> 72 
<211> 560 

<212> DNA ■ 
<213> Homo sapiens 

<220> 

<221> mi sc_f eature 

<222> 83, 173, 183, 186, 209, 211, 215, 255, 321, 322, 323, 335, 
344, 357, 361, 368, 394, 412, 415, 442, 455, 469, 472, 475, 



WO 02/00174 



PCT/US01/21065 



30 



487, 513, 522, 528, 531, 534, 546 
<223> n = A, T, C or G 

<400> 72 

cctggacttg tcttggttcc agaacctgac gacccggcga cggcgacgtc tcttttgact 60 
aaaagacagt gtccagtgct ccngcctagg agtctacggg gaccgcctcc cgcgccgcca 120 
ccatgcccaa cttctctggc aactggaaaa tcatccgatc ggaaaacttc gangaattgc 180 
tcnaantgct gggggtgaat gtgatgctna ngaanattgc tgtggctgca gcgtccaagc 240 
cagcagtgga gatcnaacag gagggagaca ctttctacat caaaacctcc accaccgtgc 300 
gcaccacaaa gattaacttc nnngttgggg aggantttga ggancaaact gtggatngga 360 
ngcctgtnaa aacctggtga aatgggagaa tganaataaa atggtctgtg ancanaaact 420 
cctgaaagga gaaggccccc anaactcctg gaccngaaaa actgacccnc cnatngggga 480 
actgatnctt gaaccctgaa cgggcgggat ganccttttt tnttgccncc naangggttc 540 
tttccntttc cccaaaaaaa 560 

<210> 73 

<211> 379 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc feature 

<222> 8, 17, 18, 21, 26, 29, 30, 32, 53, 56, 67, 71, 81, 102, 104, 
111, 112, 114, 119, 122, 124, 125, 134, 144, 146, 189, 190, 
214, 215, 219, 220, 235, 237, 246, 280, 288, 302, 310 r 313, 
319, 322, 343, 353, 354 
<223> n = A, T, C or G 



<400> 73 

•ctggggancc ggcggtnngc nccatntcnn 
aaccgcncaa naaacatgcc naagatatgg 
gnanngagga acanaacaaa ctcnangagc 
ttggccacnn gtggaattaa gaaatctggc 
ataagngacc ctttatttca tctgtattta 
tnccacgtan agntggaant anttgttgtc 
ttgttcaaaa aaaaaataa 

<210> 74 

<211> 437 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> miscjEeature 
<222> 145, 355 
<223> n = A, T,C or G 



gncgcgaagg tggcaataaa aanccnctga 60 
acgaggaaga tngngctttc nngnacaanc 120 
tctcaagcta atgccgcggg gaaggggccc 180 
aaanngtann tgttccttgt gcctnangag 240 
aacctctctn ttccctgnca taacttcttt 300 
ttggactgtt gtncatttta gannaaactt 360 

379 



<400> 74 

actagttcag actgccacgc caaccccaga 
ctaggtgttt ccatctatgt ttcaatctgt 
acaaaaaaac gctgccaggt tttanaagca 
caccagggtt cttttgaaat agtaccacat 
aatcactgaa ttgtcaggct ttgattgata 
gaataagtta taatcagtat tcatctcttt 
gtcatttgta ctgtttgaaa aatatttctt 
aaaaaaaaaa aaaaaaa 



aaatacccca catgccagaa aagtgaagtc 60 
ccatctacca ggcctcgcga taaaaacaaa 120 
gttctggtct caaaaccatc aggatcctgc 180 
gtaaaaggga atttggcttt cacttcatct 240 
attgtagaaa taagtagcct tctgttgtgg 300 
gttttttgtc actcttttct ctctnattgt 360 
ctataaaatt aaactaacct gccttaaaaa 420 

437 



<210> 75 
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. <211> 579 / 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> mis cofeature 
<222> 440, 513, 539, 551 
<223> n = A,T,C or G 

<400> 75 

ctccgtcgcc gccaagatga tgtgcggggc gccctccgcc acgcagccgg ccaccgccga 60 
gacccagcac atcgccgacc aggtgaggtc ccagcttgaa gagaaagaaa acaagaagtt 120 
ccctgtgttt aaggccgtgt cattcaagag ccaggtggtc gcggggacaa actacttcat 180 
caaggtgcac gtcggcgacg aggacttcgt acacctgcga gtgttccaat ctctccctca 240 
tgaaaacaag cccttgacct tatctaacta ccagaccaac aaagccaagc atgatgagct 300 
gacctatttc tgatcctgac tttggacaag gcccttcagc cagaagactg acaaagtcat 360 
cctccgtcta ccagagcgtg cacttgtgat cctaaaataa gcttcatctc cgggctgtgc 420 
ccttggggtg gaaggggcan gatctgcact gcttttgcat ttctcttcct aaatttcatt 480 
gtgttgattc tttccttcca ataggtgatc ttnattactt tcagaatatt ttccaaatna 540 
gatatatttt naaaatcctt aaaaaaaaaa aaaaaaaaa 579 

<210> 76 

<211> 666 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 411, 470, 476, 491, 506, 527, 560, 570, 632, 636, 643, 650, 
654, 658 

<223> n — A, T, C or G 
<400> 7 6 

gtttatccta tctctccaac cagattgtca gctccttgag ggcaagagcc acagtatatt 60 
tccctgtttc ttccacagtg cctaataata ctgtggaact aggttttaat aattttttaa 120 
ttgatgttgt tatgggcagg atggcaacca gaccattgtc tcagagcagg tgctggctct 180 
ttcctggcta ctccatgttg gctagcctct ggtaacctct tact tat tat cttcaggaca 240 
ctcactacag ggaccaggga tgatgcaaca tccttgtctt tttatgacag gatgtttgct 300 
cagcttctcc aacaataaaa agcacgtggt aaaacacttg cggatattct ggactgtttt 360 
taaaaaatat acagtttacc gaaaatcata ttatcttaca atgaaaagga ntttatagat 420 
cagccagtga acaacctttt cccaccatac aaaaattcct tttcccgaan gaaaanggct 480 
ttctcaataa ncctcacttt cttaanatct tacaagatag ccccganatc ttatcgaaac 540 
tcattttagg caaatatgan ttttattgtn cgttacttgt ttcaaaattt ggtattgtga 600 
atatcaatta ccacccccat ctcccatgaa anaaanggga aanggtgaan ttcntaancg 660 
Cttaaa 666 

<210> 77 

<211> 396 

<212> DNA 

<213> Homo sapiens 

<220> f 
<221> mis cofeature 

<222> 31, 54, 125, 128, 136, 163, 168, 198 
<223> n = A,T,C or G 

<400> 77 

ctgcagcccg ggggatccac taatctacca nggttatttg gcagctaatt ctanatttgg 60 
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atcattgccc aaagttgcac ttgctggtct 
catanganta tgccanaata aattccattt 
^ggtccacag cataacangc actgcctcct 
attaagtgag aagggagact ctcagccttc 
gaagtttttt aaacctctga atttgtacac 
aatacttcta atgggaacaa aaaaaaaaaa 

<210> 78 

<211> 793 

<212> DNA 

<213> Homo sapiens 



cttgggattt ggccttggaa aggtatcata 120 
ttttgaaaat canctccntg gggctggttt 180 
tacctgtgag gaatgcaaaa taaagcatgg 240 
agcttcctaa attctgtgtc tgtgactttc 300 
atttaaaatt tcaagtgtac tttaaaataa 360 
aaaaaa 396 



<220> 

<221> raisc_feature 

<222> 309, 492, 563, 657 f 660, 703, 708, 710, 711, 732, 740, 748, 
758, 762, 765, 787 
<223> n « A,T,C or G 



<400> 78 

gcatcctagc cgccgactca cacaaggcag 
gaaaattcca gtgtcagcat tcttgctcct 
taccacagtc aaacctggag ccaaaaagga 
gaccctctcc agaggttggg gtgaccaact 
atataaatcc aagacaagca acaaaccctt 
acacagtcna gctttaaaga aagtgtttgc 
gcagtttgtc ctcctcaatc tggtttatga 
ccagtatgtc ccaggattat gtttgttgac 
ggaagatatt cnaaccgtct ctatgcttac 
atgaaaaagc tctcaagttg ctnaaaatga 
tctgtcggct tgaaaattga aaccagaaaa 
gacacctgat taggttttgg ttatgttcac 
ttggttcaat tntctttttn aaacaatntg 
aataatnttt ggc 

<210> 79 

<211> 456 

<212> DNA 

<213> Homo sapiens 



gtgggtgagg aaatccagag ttgccatgga 60 
tgtggccctc tcctacactc tggccagaga 120 
cacaaaggac tctcgaccca aactgcccca 180 
catctggact cagacatatg aagaagctct 240 
gatgattatt catcacttgg atgagtgccc 300 
tgaaaataaa gaaatccaga aattggcaga 360 
aacaactgac aaacaccttt ctcctgatgg 420 
ccatctctga cagttgaagc cgatatcctg 480 
aaactgcaga tacgctctgt tgcttgacac 540 
attgtaagaa aaaaaatctc cagccttctg 600 
atgtgaaaaa tggctattgt ggaacanatn 660 
cactattttt aanaaaanan nttttaaaat 720 
tttctacntt gnganctgat ttctaaaaaa 780 

793 



<220> 

<221> mis cofeature 

<222> 89, 195, 255, 263, 266, 286, 353, 384, 423, 425, 436, 441 
<223> n = A, T,C or G 



<400> 79 

actagtatgg ggtgggaggc cccacccttc 
ctccgtggag agggactggc agagctgang 
gcagctgttg agcgcaccta accactggtc 
tcctcccgac cccangacca ggctacttct 
tgcctctgat cgtangaatt gangantgtc 
ggggctggaa atgggtgtgt gtgtgtgtgt 
tgcaagaccg agattgaggg aaancatgtc 
aantncccct gtgacnctca naaaaaaaaa 



tcccctaggc gctgttcttg ctccaaaggg 60 
ccacctgggg ctggggatcc cactcttctt 120 
atgcccccac ccctgctctc cgcacccgct 180 
cccctcctct tgcctccctc ctgcccctgc 240 
ccgccttgtg gctganaatg gacagtggca 300 
gtgtgtgtgt gtgtgtgtgt gcnccccccc 360 
tgctgggtgt gaccatgttt cctctccata 420 
aaaaaa 456 



<210> 80 

<211> 284 

<212> DNA 

<213> Homo sapiens 
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<220> 

<<221> misc_feature 
<222> 283 

<223> n » A,T,C or G 
<400> 80 

ctttgtacct ctagaaaaga taggtattgt gtcatgaaac ttgagtttaa attttatata 60 
taaaactaaa agtaatgctc actttagcaa cacatactaa aattggaacc atactgagaa 120 
gaatagcatg acctccgtgc aaacaggaca agcaaatttg tgatgtgttg attaaaaaga 180 
aataaataaa tgtgtatatg tgtaacttgt atgtttatgt ggaatacaga ttgggaaata 240 
aaatgtattt cttactgtga aaaaaaaaaa aaaaaaaaaa aana 284 

<210> 81 

<211> 671 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 388, 505, 600, 603, 615, 642, 644, 660 
<223> n = A, T,C or G 



<400> 81 

gccaccaaca ttccaagcta ccctgggtac 
agcaagcggt gtgcacacgg agactcatcg 
gaaaggctgg ggatatttgg gttggcttgg 
tactaaaaca gtattatctt ttgaatatcg 
tcaagatggc tagaatggtg cctttctgag 
ttcaacacac ttccactgcc tgcgtaatga 
tttcaatgcc gtcattttca gttagatnat 
atttgattag tcttattttt ttatttttac 
tgtgacaaag tcaaataaac ccccnaggac 
acattaagct ttggccaaaa aatgttgcat 
canaaaggct ggctnataat gttggtggtg 
aaaaaaaaaa a 

<210> 82 

<211> 217 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 35 

<223> n = A, T, C or G 



ctttgtgcag tagaagctag tgagcatgtg 60 
ttataattta ctatctgcca agagtagaaa 120 
ttttgatttt ttgcttgttt gtttgttttg 180 
tagggacata agtatataca tgttatccaa 240 
tgtctaaaac ttgacacccc tggtaaatct 300 
agttttgatt catttttaac cactggaatt 360 
tttgcacttt gagattaaaa tgccatgtct 420 
aggcttatca gtctcactgt tggctgtcat 480 
aacacacagt atgggatcac atattgtttg 540 
gtgttttacc tcgacttgct aaatcaatan 600 
aaataattaa tnantaacca aaaaaaaaan 660 

671 



<400> 82 

ctgcagatgt ttcttgaatg ctttgtcaaa 
agacaataag tggtggtgta tcttgtttct 
tcttattagg gagttgtatg tcagtgtata 
aaattcttta aaaggaaaaa aaaaaaaaaa 

<210> 83 

<211> 460 

<212> DNA 

<213> Homo sapiens 



ttaanaaagt taaagtgcaa taatgtttga 60 
aataagataa acttttttgt ctttgcttta 120 
aaacatactg tgtggtataa caggcttaat 180 
aaaaaaa 1 217 
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<220> 

<221> misc_feature 

«222> 104, 118, 172, 401, 422, 423, 444, 449 
<223> n = A,T,C or G 

<400> 83 

cgcgagtggg agcaccagga tctcgggctc ggaacgagac tgcacggatt gttttaagaa 60 

aatggcagac aaaccagaca tgggggaaat cgccagcttc gatnaggcca agctgaanaa 120 

aacggagacg caggagaaga acaccctgcc gaccaaagag accattgagc angagaagcg 180 

t gagtgaaatt tcctaagatc ctggaggatt tcctaccccc gtcctcttcg agaccccagt 240 

cgtgatgtgg aggaagagcc acctgcaaga tggacacgag ccacaagctg cactgtgaac 300 

ctgggcactc cgcgccgatg ccaccggcct gtgggtctct gaagggaccc cccccaatcg 360 

gactgccaaa ttctccggtt tgccccggga tattatacaa nattatttgt atgaataatg 420 

annataaaac acacctcgtg gcancaaana aaaaaaaaaa 460 

<210> 84 

<211> 323 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 70, 138, 178, 197, 228, 242, 244, 287, 311 
<223> n « A,T,C or G 

<400> 84 

tggtggatct tggctctgtg gagctgctgg gacgggatct aaaagactat tctggaagct 60 
gtggtccaan gcattttgct ggcttaacgg gtcccggaac aaaggacacc agctctctaa 120 
aattgaagtt tacccganat aacaatcttt tgggcagaga tgcctatttt aacaaacncc 180 
gtccctgcgc aacaacnaac aatctctggg aaataccggc catgaacntg ctgtctcaat 240 
cnancatctc tctagctgac cgatcatatc gtcccagatt actacanatc ataataattg 300 
atttcctgta naaaaaaaaa aaa 323 

<210> 85 

<211> 771 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 63, 426, 471, 497, 521, 554, 583, 586, 606, 609, 615, 652, 
686, 691, 694, 695, 706, 713, 730, 732, 743, 751 
<223> n - A,T,C or G 

<400> 85 

aaactgggta ctcaacactg agcagatctg ttctttgagc taaaaaccat gtgctgtacc 60 
aanagtttgc tcctggctgc tttgatgtca gtgctgctac tccacctctg cggcgaatca 120 
gaagcaagca actttgactg ctgtcttgga tacacagacc gtattcttca tcctaaattt 180 
attgtgggct tcacacggca gctggccaat gaaggctgtg acatcaatgc tatcatcttt 240 
cacacaaaga aaaagttgtc tgtgtgcgca aatccaaaac agacttgggt gaaatatatt 300 
gtgcgtctcc tcagtaaaaa agtcaagaac atgtaaaaac tgtggctttt ctggaatgga 360 
attggacata gcccaagaac agaaagaact tgctggggtt ggaggtttca cttgcacatc 420 
atgganggtt tagtgcttat cttatttgtg cctcctggac ttgtccaatt natgaagtta 480 
atcatattgc atcatanttt gctttgttta acatcacatt naaattaaac tgtattttat 540 
gttatttata gctntaggtt ttctgtgttt aactttttat acnaantttc ctaaactatt 600 
ttggtntant gcaanttaaa aattatattt ggggggggaa taaatattgg antttctgca 660 
gccacaagct ttttttaaaa aaccantaca nccnngttaa atggtnggtc ccnaatggtt 720 
tttgcttttn antagaaaat ttnttagaac natttgaaaa aaaaaaaaaa a 771 
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<210> 86 
'<211> 628 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> miscjeature 

<222> 162, 249, 266, 348, 407, 427, 488, 518, 545, 566, 569, 597, 
598, 611, 617, 621, 624 
<223> n = A,T,C or G 

<400> 86 

actagtttgc tttacatttt tgaaaagtat tatttttgtc caagtgctta tcaactaaac 60 
cttgtgttag gtaagaatgg aatttattaa gtgaatcagt gtgacccttc ttgtcataag 120 
attatcttaa agctgaagcc aaaatatgct tcaaaagaaa angactttat tgttcattgt 180 
agttcataca ttcaaagcat ctgaactgta. gtttctatag caagccaatt acatccataa 240 
gtggagaang aaatagatta atgtcnaagt atgattggtg gagggagcaa ggttgaagat 300 
aatctggggt tgaaattttc tagttttcat tctgtacatt tttagttnga catcagattt 360 
gaaatattaa tgtttacctt tcaatgtgtg gtatcagctg gactcantaa cacccctttc 420 
ttccctnggg gatggggaat ggattattgg aaaatggaaa gaaaaaagta cttaaagcct 480 
tcctttcnca gtttctggct cctaccctac tgatttancc agaataagaa aacattttat 540 
catcntctgc tttattccca ttaatnaant tttgatgaat aaatctgctt ttatgcnnac 600 
ccaaggaatt nagtggnttc ntcnttgt 528 

<210> 87 

<211> 518 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 384, 421, 486 
<223> n = A,T,C or G 

<400> 87 

ttttttattt tttttagaga gtagttcagc ttttatttat aaatttattg cctgttttat 60 
tataacaaca ttatactgtt tatggtttaa tacatatggt tcaaaatgta taatacatca 120 
agtagtacag ttttaaaatt ttatgcttaa aacaagtttt gtgtaaaaaa tgcagataca 180 
ttttacatgg caaatcaatt tttaagtcat cctaaaaatt gatttttttt tgaaatttaa 240 
aaacacattt aatttcaatt tctctcttat ataaccttta ttactatagc atggtttcca 300 
ctacagttta acaatgcagc aaaattccca tttcacggta aattgggttt taagcggcaa 360 
ggttaaaatg ctttgaggat cctnaatacc ctttgaactt caaatgaagg ttatggttgt 420 
naatttaacc ctcatgccat aagcagaagc acaagtttag ctgcattttg ctctaaactg 480 
taaaancgag ccccccgttg aaaaagcaaa agggaccc 518 

<210> 88 

<211> 1844 

<212> DNA 

<213> Homo sapiens 

<400> 88 ' 
gagacagtga atcctagtat caaaggattt ttggcctcag aaaaagttgt tgattatttt 60 
tattttattt tatttttcga gactccgtct caaaaaaaaa aaaaaaaaaa agaatcacaa 120 
ggtatttgct aaagcatttt gagctgcttg gaaaaaggga agtagttgca gtagagtttc 180 
ttccatcttc ttggtgctgg gaagccatat atgtgtcttt tactcaagct aaggggtata 240 
agcttatgtg ttgaatttgc tacatctata tttcacatat tctcacaata agagaatttt 300 
gaaatagaaa tatcatagaa catttaagaa agtttagtat aaataatatt ttgtgtgttt 360 
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taatcccttt gaagggatct atccaaagaa aatattttac actgagctcc ttcctacacg 420 
tctcagtaac agatcctgtg ttagtctttg aaaatagctc attttttaaa tgtcagtgag 480 
%agatgtagc atacatatga tgtataatga cgtgtattat gttaacaatg tctgcagatt 540 
ttgtaggaat acaaaacatg gcctttttta taagcaaaac gggccaatga ctagaataac 600 
acatagggca atctgtgaat atgtattata agcagcattc cagaaaagta gttggtgaaa 660 
taattttcaa gtcaaaaagg gatatggaaa gggaattatg agtaacctct attttttaag 720 
ccttgctttt aaattaaacg ctacagccat ttaagccttg aggataataa agcttgagag 780 
taataatgtt aggttagcaa aggtttagat gtatcacttc atgcatgcta ccatgatagt 840 
aatgcagctc ttcgagtcat ttctggtcat tcaagatatt cacccttttg cccatagaaa 900 
t gcaccctacc tcacctgctt actgacattg tcttagctga tcacaagatc attatcagcc 960 
tccattattc cttactgtat ataaaataca gagttttata ttttcctttc ttcgtttttc 1020 
accatattca aaacctaaat ttgtttttgc agatggaatg caaagtaatc aagtgttcgt 1080 
gctttcacct agaagggtgt ggtcctgaag gaaagaggtc cctaaatatc ccccaccctg 1140 
ggtgctcctc cttccctggt accctgacta ccagaagtca ggtgctagag cagctggaga 1200 
agtgcagcag cctgtgcttc cacagatggg ggtgctgctg caacaaggct ttcaatgtgc 1260 
ccatcttagg gggagaagct agatcctgtg cagcagcctg gtaagtcctg aggaggttcc 1320 
attgctcttc ctgctgctgt cctttgcttc tcaacggggc tcgctctaca gtctagagca 1380 
catgcagcta acttgtgcct ctgcttatgc atgagggtta aattaacaac cataaccttc 1440- 
atttgaagtt caaaggtgta ttcaggatcc tcaaagcatt ttaaccttgc cgcttaaaac 1500 
ccaatttacc gtgaaatggg aattttgctg cattgttaaa ctgtagtgga aaccatgcta 1560 
tagtaataaa ggttatataa gagagaaatt gaaattaaat gtgtttttaa atttcaaaaa 1620 
aaaatcaatc tttaggatga cttaaaaatt gatttgccat gtaaaatgta tctgcatttt 1680 
ttacacaaaa cttgttttaa gcataaaatt ttaaaactgt actacttgat gtattataca 1740 
ttttgaacca tatgtattaa accataaaca gtataatgtt gttataataa aacaggcaat 1800 
aaatttataa ataaaagctg aaaaaaaaaa aaaaaaaaaa aaaa 1844 

<210> 89 

<211> 523 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 288, 352, 369, 398, 475, 511, 513 
<223> n = A,T,C or G 

<400> 89 

tttttttttt tttttttagt caatccacat ttattgatca cttattatgt accaggcact 60 
gggataaaga tgactgttag tcactcacag taaggaagaa aactagcaaa taagacgatt 120 
acaatatgat gtagaaaatg ctaagccaga gatatagaaa ggtcctattg ggtccttctg 180 
tcaccttgtc tttccacatc cctacccttc acaggccttc cctccagctt cctgcccccg 240 
ctccccactg cagatcccct gggattttgc ctagagctaa acgagganat gggccccctg 300 
gccctggcat gacttgaacc caaccacaga ctgggaaagg gagcctttcg anagtggatc 360 
actttgatna gaaaacacat agggaattga agagaaantc cccaaatggc cacccgtgct 420 
ggtgctcaag aaaagtttgc agaatggata aatgaaggat caagggaatt aatanatgaa 480 
taattgaatg gtggctcaat aagaatgact ncnttgaatg acc 523 

<210> 90 

<211> 604 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 563 

<223> n = A,T,C or G 



<400> 90 
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ccagtgtggt ggaatgcaaa 
gcaaaggaaa tagccaatat 
^ctcacccacc aactaaatcc 
gggagccttc aagggcatgt 
ctctttcctc tctgatcctt 
ctggaatgca attgtttgca 
cactgcattg gtgataggac 
aggggtggta cacctgtttg 
accactaatg gggagggcag 
/ ccctaattgc tgaaattccc 
cccc 

<210> 91 
<211> 858 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> mis c__f eat lire 
<222> 570, 591, 655, 664, 667, 683, 711, 759, 760, 765, 777, 787, 
792, 794, 801, 804, 809, 817, 820 
<223> n = A,T,C or G 

<400> 91 

tttttttttt ttttttttta tgattattat tttttttatt gatctttaca tcctcagtgt 60 
tggcagagtt tctgatgctt aataaacatt tgttctgatc agataagtgg aaaaaattgt 120 
catttcctta ttcaagccat gcttttctgt gatattctga tcctagttga acatacagaa 180 
ataaatgtct aaaacagcac ctcgattctc gtctataaca ggactaagtt cactgtgatc 240 
ttaaataagc ttggctaaaa tgggacatga gtggaggtag tcacacttca gcgaagaaag 300 
agaatctcct gtataatctc accaggagat tcaacgaatt ccaccacact ggactagtgg 360 
atcccccggg ctgcaggaat tcgatatcaa gcttatcgat accgtcgacc tcgagggggg 420 
gcccggtacc caattcgccc tatagtgagt cgtattacgc gcgctcactg gccgtcgttt 480 
tacaacgtcg tgactgggaa aaccctggcg ttacccaact taatcgcctt gcagcacatc 540 
cccctttcgc cagctggcgt aatagcgaan agcccgcacc gatcgccctt ncaacagttg 600 
cgcagcctga atggcgaatg ggacgcgccc tgtagcggcg cattaaagcg cggcngggtg 660 
tggnggntcc cccacgtgac cgntacactt ggcagcgcct tacgccggtc nttcgctttc 720 
ttcccttcct ttctcgcacc gttcgccggg tttccccgnn agctnttaat cgggggnctc 780 
cctttanggg tncnaattaa nggnttacng gaccttngan cccaaaaact ttgattaggg 840 
ggaaggtccc cgaagggg 858 

<210> 92 
<211> 585 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> miscfeature 

<222> 317, 319, 320, 321, 325, 327, 328, 330, 331, 332, 460, 462, 
483, 485, 487, 523, 538, 566, 584 
<223> n = A, T, C or G 

<400> 92 ! 
gttgaatctc ctggtgagat tatacaggag attctctttc ttcgc.tgaag tgtgactacc 60 
tccactcatg tcccatttta gccaagctta tttaagatca cagtgaactt agtcctgtta 120 
tagacgagaa tcgaggtgct gttttagaca tttatttctg tatgttcaac taggatcaga 180 
atatcacaga aaagcatggc ttgaataagg aaatgacaat tttttccact tatctgatca 240 
gaacaaatgt ttattaagca tcagaaactc tgccaacact gaggatgtaa agatcaataa 300 
aaaaaataat aatcatnann naaanannan nngaagggcg gccgccaccg cggtggagct 360 



gattaccccg gaagctttcg 
gtgtcgtttc tatgaaatga 
caaagtcaaa agcttcagcc 
agaaaatcag ctgttcagat 
ttcctcttta cggcacaaca 
acaccgaagg atttcctgcg 
acggtaattt gattcacatt 
gtaaaatgag aagcctcgga 
attattactg ggatttctcc 
ctnggcaggc tccagttttc 



agaagctggg attccctgca 60 
agccagaccg agatgtcaat 120 
agtttatctc agagaaccag 180 
aggcctctgc accacacagc 240 
ttcatgtttg acagaacatg 300 
gtcgcctctt cagtaggaag 360 
taacttgcta gttagtgata 420 
aacttgggag cttctctcct 480 
tggggtgaat taatttcaag 540 
tcaactgcat tgcaaaattc 600 

604 
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ccagcttttg ttccctttag tgagggttaa ttgcgcgctt ggcgttaatc atggtcatag 420 

ctgtttcctg tgtgaaattg ttatccggct cacaattccn cncaacatac gagccgggaa 480 

igcntnangtg taaaagcctg ggggtgccta attgagtgag ctnactcaca ttaattgngt 540 

tgcgctccac ttgcccgctt ttccantccg ggaaacctgt tcgnc 585 

<210> 93 

<211> 567 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 82, 158, 230, 232, 253, 266, 267, 268, 269, 270, 271, 272, 
273, 274, 275, 276, 277, 278, 279, 280, 281, 282, 283, 284, 
285, 286, 287, 295, 303, 307, 314, 349, 352, 354, 356, 366, 
369, 379, 382, 386, 393, 404, 427, 428, 446, 450, 452 
<223> n = A, T, C or G 

<221> misc_feature 

<222> 453, 454, 459, 462, 480, 481, 483, 488, 493, 501, 509, 511, 
512, 518, 520, 525, 526, 532, 541, 557 
<223> n = A,T,C or G 



<400> 93 

cggcagtgtt gctgtctgcg tgtccacctt 
agactgcggc tggggtgggc anggaaggga 
ttccctgtac ccaccttccc cttgcttcat 
ccagtttcct tgtgtgatac actaatgtat 
attaaattgc tantgtttct ttgaannnnn 
ccncggngga aacnccccct tttgttccct 
gttaanccnt gggccaaanc tngttncccg 
ccccccnncc ttccaaaccc ggaaancctn 
ngnaattnaa ccnaaccccc ntttaaatng 
nttcggggaa aaccctntcc gtgccca 

<210> 94 
<211> 620 
<212> DNA 

<213> Homo sapiens 



ggaatctggc tgaactggct gggaggacca 60 
accgggggct gctgtgaagg atcttggaac 120 
gtttgtanag gaaccttgtg ccggccaagc 180 
ttgctttttt tgggaaatan anaaaaatca 240 
nnnnnnnnnn nnnnnnnggg ggggncgccc 300 
ttaattgaaa ggttaattng cncncntggc 360 
tgntgaaatt gttnatcccc tcccaaattc 420 
annntgttna ancccggggg gttgcctaan 480 
nntttgcncn ccacnngccc cnctttccca 540 

567 



<220> 

<221> misc_feature 

<222> 169, 171, 222, 472, 528, 559, 
<223> n = A,T,C or G 



599 



<400> 94 

actagtcaaa 

catgtttatc 

gccaatattt 

gtgaaactta 

gttcttgtta 

ataaggttaa 

tttcaagcct 

gagaatttct 

atgtctctaa 

tttcccttaa 

agggttaagg 



aatgctaaaa 
ttttattatg 
ccttatatct 
acactttata 
tttccaaata 
aagttgttaa 
tcgaactatt 
cattaatatc 
gaaagtacta 
gtgtgaaant 
gtgttgggga 



taatttggga 
ttttgtgaag 
atccataaca 
aggtaaaaat 
gaatggactt 
tgaccaaaca 
taaggaaagc 
ctgaatcatt 
tttcatggtc 
atttaaaatg 



gaaaatattt 
ttgtgtcttt 
tttatactac 
gaggtttcca 
ggtctgttaa 
ttctaaaaga 
aaaatcattt 
catttcacta 
caaacctggt 
aaattttcct 



tttaagtagt 
tcactaatta 
atttgtaana 
anatttaata 
gggctaagga 
aatgcaaaaa 
cctaaatgca 
aggctcatgt 
tgccatantt 
ctttttaaaa 



gttatagttt 60 
cctatactat 120 
naatatgcac 180 
atctgatcaa 240 
gaagaggaag 300 
aaaagtttat 360 
tatcatttgt 420 
tnactccgat 480 
gggtaaaggc 540 
attctttana 600 
620 
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<210> 95 

<211> 470 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<222> 61, 67, 79, 89, 106, 213, 271, 281, 330, 354, 387, 432, 448 
<223> n - A,T,C or G 

<400> 95 

ctcgaccttc tctgcacagc ggatgaaccc tgagcagctg aagaccagaa aagccactat 60 
nactttntgc ttaattcang agcttacang attcttcaaa gagtgngtcc agcatccttt 120 
gaaacatgag ttcttaccag cagaagcaga cctttacccc accacctcag cttcaacagc 180 
agcaggtgaa acaacccatc cagcctccac ctnaggaaat atttgttccc acaaccaagg 240 
agccatgcca ctcaaaggtt ccacaacctg naaacacaaa nattccagag ccaggctgta 300 
ccaaggtccc tgagccaggg ctgtaccaan gtccctgagc caggttgtac caangtccct 360 
gagccaggat gtaccaaggt ccctgancca ggttgtccaa ggtccctgag ccaggctaca 420 
ccaagggcct gngccaggca gcatcaangt ccctgaccaa ggcttatcaa 470 

<210> 96 

<211> 660 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc__feature 

<222> 299, 311, 360, 426, 538, 540, 542, 553, 563, 565, 592, 603, 
604, 618, 633, 647, 649, 651, 653 
<223> n = A,T,C or G 

<400> 96 

tttttttttt tttttttttt ggaattaaaa gcaatttaat gagggcagag caggaaacat 60 
gcatttcttt tcattcgaat cttcagatga accctgagca gccgaagacc agaaaagcca 120 
tgaagacttt ctgcttaatt caggggctta caggattctt cagagtgtgt gtgaacaaaa 180 
gctttatagt acgtattttt aggatacaaa taagagagag actatggctt ggggtgagaa 240 
tgtactgatt acaaggtcta cagacaatta agacacagaa acagatggga agagggtgnc 300 
cagcatctgg nggttggctt ctcaagggct tgtctgtgca ccaaattact tctgcttggn 360 
cttctgctga gctgggcctg gagtgaccgt tgaaggacat ggctctggta cctttgtgta 420 
gcctgncaca ggaactttgg tgtatccttg ctcaggaact ttgatggcac ctggctcagg 480 
aaacttgatg aagccttggt caagggacct tgatgcttgc tggctcaggg accttggngn 540 
ancctgggct canggacctt tgncncaacc ttggcttcaa gggacccttg gnacatcctg 600 
gcnnagggac ccttgggncc aaccctgggc ttnagggacc ctttggntnc nanccttggc 660 



<210> 97 

<211> 441 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 12, 308 

<223> n = A,T,C or G 



<400> 97 

gggaccatac anagtattcc tctcttcaca ccaggaccag ccactgttgc agcatgagtt 60 
cccagcagca gaagcagccc tgcatcccac cccctcagct tcagcagcag caggtgaaac 120 
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agccttgcca gcctccacct caggaaccat 
ccaaggtgcc tgagccctgc caccccaaag 
agccatgcca ccccaaggtg cctgagccct 
agcagaanac caagcagaag taatgtggtc 
agatgctgaa tcccctatcc cattctgtgt 
ctgtctcccc caaaaaaaaa a 



gcatccccaa aaccaaggag ccctgccacc 180 
tgcctgagcc ctgccagccc aaggttccag 240 
gcccttcaat agtcactcca gcaccagccc 300 
cacagccatg cccttgagga gccggccacc 360 
atgagtccca tttgccttgc aattagcatt 420 

441 



<210> 98 
<211> 600 
/ <212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<222> 295, 349, 489, 496, 583 

<223> n = A,T,C or G 

<400> 98 

gtattcctct cttcacacca ggaccagcca 
gcagccctgc atcccacccc ctcagcttca 
tccacctcag gaaccatgca tccccaaaac 
gccctgccac cccaaagtgc ctgagccctg 
caaggtgcct gagccctgcc cttcaatagt 
gcagaagtaa tgtggtccac agccatgccc 
cctatcccat tctgtgtatg agtcccattt 
aaaagaatgt gctatgaagc tttctttcct 
ggtcttaant acaganctag ttttcagctg 
tgaaaggcaa atgattcagc tccttattac 



ctgttgcagc atgagttccc agcagcagaa 60 
gcagcagcag gtgaaacagc cttgccagcc 120 
caaggagccc tgccacccca aggtgcctga 180 
ccagcccaag gttccagagc catgccaccc 240 
cactccagca ccagcccagc agaanaccaa 300 
ttgaggagcc ggccaccana tgctgaatcc 360 
gccttgcaat tagcattctg tctcccccaa 420 
acacactctg agtctctgaa tgaagctgaa 480 
ctcagaattc tctgaagaaa agatttaaga 540 
cccattaaat tcnctttcaa ttccaaaaaa 600 



<210> 99 

<211> 667 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 345, 562, 635 
<223> n = A,T,C or G 



<400> 99 

actagtgact gagttcctgg caaagaaatt 
accatttaaa aaaatcagtg aaggatttga 
ggtcctgacg ttttgagatc caaagtggca 
tttctcttgt gagagttccc tcatctgaaa 
agtagaagat ttgttgaaga catagaaccc 
ttaaagtctt gtgagcacct gggaattagt 
attttgtaag gctataattg tatcttttaa 
tggagatttt taagagtttt aaccagctgc 
gtataaagat atagtaaatg catctcctag 
ttatttttta gatttgaata tnaatgttat 
attacatttt gaaatcagtt cattccatga 
cggaaaa 



tgacctggac cagttgataa ctcatgtttt 60 
gctgctcaat tcaggacaaa gcattcgaac 120 
ggaggtctgt gttgtcatgg tgaactggag 180 
tcatgtatct gtctcacaaa tacaagcata 240 
ttataaagaa ttattaacct ttataaacat 300 
ataataacaa tgttnatatt tttgatttac 360 
gaaaacatac cttggatttc tatgttgaaa 420 
tgcagatata ttactcaaaa cagatatagc 480 
agtaatattc acttaacaca ttggaaacta 540 
tttttaaaca cttgttatga gttacttggg 600 
tgcanattac tgggattaga ttaagaaaga 660 

667 



<210> 100 
<211> 583 
<212> DNA 
<213> Homo sapiens 
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<220> 

;<221> misc_feature 

<222> 404, 506, 514, 527, 528, 538, 548, 556, 568, 569 
<223> n « A,T,C or G 

<400> 100 

gttttgtttg taagatgatc acagtcatgt tacactgatc taaaggacat atatataacc 60 
ctttaaaaaa aaaatcactg cctcattctt atttcaagat gaatttctat acagactaga 120 
tgtttttctg aagatcaatt agacattttg aaaatgattt aaagtgtttt ccttaatgtt 180 
ctctgaaaac aagtttcttt tgtagtttta accaaaaaag tgcccttttt gtcactggat 240 
tctcctagca ttcatgattt ttttttcata caatgaaatt aaaattgcta aaatcatgga 300 
ctggctttct ggttggattt caggtaagat gtgtttaagg ccagagcttt tctcagtatt 360 
tgattttttt ccccaatatt tgatttttta aaaatataca catnggtgct gcatttatat 420 
ctgctggttt aaaattctgt catatttcac ttctagcctt ttagttatgg caaatcatat 480 
tttactttta cttaaagcat ttggtnattt ggantatctg gttctannct aaaaaaanta 540 
attctatnaa ttgaantttt ggtactcnnc catatttgga tec 593 

<210> 101 
<211> 592 
<212> DMA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 218, 497, 502, 533, 544, 546, 548, 550, 555 
<223> n - A,T,C or G 

<400> 101 

gtggagacgt acaaagagca gccgctcaag acacctggga agaaaaagaa aggcaagccc 60 
gggaaacgea aggagcagga aaagaaaaaa cggcgaactc gctctgcctg gttagactct 120 
ggagtgactg ggagtgggct agaaggggac cacctgtctg acacctccac aacgtcgctg 180 
gagctcgatt caeggaggea ttgaaatttt cagcaganac cttccaagga catattgeag 240 
gattctgtaa tagtgaacat atggaaagta ttagaaatat ttattgtctg taaatactgt 300 
aaatgcattg gaataaaact gtctccccca ttgctctatg aaactgeaca ttggtcattg 360 
tgaatatttt tttttttgee aaggctaatc caattattat' tatcacattt accataattt 420 
attttgtcca ttgatgtatt tattttgtaa atgtatcttg gtgctgctga atttctatat 480 
tttttgtaca taatgcnttt anatatacct atcaagtttg ttgataaatg aencaatgaa 540 
gtgnencnan ttggnggttg aatttaatga atgectaatt ttattatccc aa 592 

<210> 102 
<211> 587 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

t 2 ,l 2> Jt\' 131/ 256 ' 263 ' 332 ' 392 > 400 ' 463 > 461 ' «6, 497, 499, 

510, 511, 518, 519, 539, 554, 560, 576 

<223> n = A, T,C or G 

<400> 102 ! 

^f°! a f^ actta 9acta catcagggaa gaacacagac cacatccctg tcctcatgcg 60 
gcttatgttt tctggaagaa agtggagacc nagtccttgg ctttagggct ccccggctgg 120 
gggctgtgca ntccggtcag ggegggaagg gaaatgeace gctgcatgtg aacttacaqc 180 
ecaggeggat gccccttccc ttagcactac ctggcctcct gcatcccctc gcctcatgtt 240 
cctcccacct tcaaanaatg aanaacccca tgggcccagc cccttgccct ggggaaccaa 300 
ggcagccttc caaaactcag gggctgaagc anactattag ggcaggggct gactttgggt 360 
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gacactgccc attccctctc agggcagctc angtcacccn ggnctcttga acccagcctg 420 

ttcctttgaa aaagggcaaa actgaaaagg gcttttccta naaaaagaaa aaccagggaa 480 

ctttgccagg gcttcnntnt taccaaaacn ncttctcnng gatttttaat tccccattng 540 

gcctccactt accnggggcn atgccccaaa attaanaatt tcccatc 587 

<210> 103 
<211> 496 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 2, 17, 66, 74, 82, 119, 164, 166, 172, 200, 203, 228, 232, 
271, 273, 415, 423, 445, 446, 473 
<223> n = A, T, C or G 

<400> 103 

anaggactgg ccctacntgc 
ctgcanccct tggncactgc 
gcggtgggtc tccaccacaa 
actggcagga tggaccttan 
cccttaacat gatataatcc 
ttgcctacag aatttcattc 
tgggctgacc gcaaaaggtg 
gangcttgcc tcctccttct 
ggaaaagaaa caaaac 

<210> 104 
<211> 575 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 18, 19, 45, 68, 77, 132, 155, 174, 219, 226, 238, 259, 263, 
271, 273, 306, 323, 339, 363, 368, 370, 376, 381, 382, 436, 
440, 449, 450, 456, 481, 485, 496, 503, 510, 512, 515, 528, 
542, 552 

<223> n « A,T,C or G 
<400> 104 

gcacctgctc tcaatccnnc tctcaccatg atcctccgcc tgcanaaact cctctgccaa 60 
ctatggangt ggtttcnggg gtggctcttg ccaactggga agaagccgtg gtgtctctac 120 
ctgttcaact cngtttgtgt ctgggggatc aactnggggc tatggaagcg gctnaactgt 180 
tgttttggtg gaagggctgg taattggctt tgggaagtng cttatngaag ttggcctngg 240 
gaagttgcta ttgaaagtng ccntggaagt ngntttggtg gggggttttg ctggtggcct 300 
ttgttnaatt tgggtgcttt gtnaatggcg gccccctcnc ctgggcaatg aaaaaaatca 360 
ccnatgcngn aaacctcnac nnaacagcct gggcttccct cacctcgaaa aaagttgctc 420 
cccccccaaa aaaggncaan cccctcaann tggaangttg aaaaaatcct cgaatgggga 480 
ncccnaaaac aaaaancccc ccntttcccn gnaanggggg aaataccncc cccccactta 540 

cnaaaaccct tntaaaaaac cccccgggaa aaaaa * 575 

i 

<210> 105 
<211> 619 
<212> DNA 
<213> Homo sapiens 



tctctctcgt cctacctatc 
anatggaaac ctctcagtgt 
ccactttgac tctgtggtcc 
ccnacatatc cctctgttcc 
acccatgcaa ntngctactg 
agtctacact ttggcattct 
ccttacacac tggcccccac 
gattnncccc catgttggat 



aatgcccaac atggcagaac 60 
cttgacatca ccct accent 120 
ctgnanggtg gnttctcctg 180 
etctgetnag anaaagaatt 240 
gcccagctac catttaccat 300 
etctggegat agagtgtggc 360 
cctcaaccgt tgaencatea 420 
atcagggtgc tcnagggatt 480 

496 



<220> 
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<221> mis cofeature 

<222> 260, 527, 560, 564, 566/ 585, 599 
4 <223> n = A,T,C or G 

<400> 105 

cactagtagg atagaaacac tgtgtcccga gagtaaggag agaagctact attgattaga 60 
gcctaaccca ggttaactgc aagaagaggc gggatacttt cagctttcca tgtaactgta 120 
tgcataaagc caatgtagtc cagtttctaa gatcatgttc caagctaact gaatcccact 180 
tcaatacaca ctcatgaact cctgatggaa caataacagg cccaagcctg tggtatgatg 240 
tgcacacttg ctagactcan aaaaaatact actctcataa atgggtggga gtattttggt 300 
gacaacctac tttgcttggc tgagtgaagg aatgatattc atatattcat ttattccatg 360 
gacatttagt tagtgctttt tatataccag gcatgatgct gagtgacact cttgtgtata 420 
tttccaaatt tttgtacagt cgctgcacat atttgaaatc atatattaag acttccaaaa 480 
aatgaagtcc ctggtttttc atggcaactt gatcagtaaa ggattcncct ctgtttggta 540 
cttaaaacat ctactatatn gttnanatga aattcctttt ccccncctcc cgaaaaaana 600 
aagtggtggg gaaaaaaaa • 519 

<210> 106 
<211> 506 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> mis cofeature 

<222> 8, 21, 31, 32, 58, 75, 89, 96, 99, 103, 122, 126, 147, 150, 
158, 195, 210, 212, 219, 226, 246, 248, 249, 255, 258, 261, 
263, 265, 275, 304, 317, 321, 331, 337, 340, 358, 371, 377, 
380, 396, 450, 491 
<223> n « A, T,C or G 



<400> 106 

cattggtnct ttcatttgct ntggaagtgt 
gccttaaact ctgtnacact tttgggaant 
angtanagat gttctggata ccattanatn 
tatgtaaatg gtatntcatt cgctactatn 
gaatantnng cagcncanct nanangctgt 
acancattgt aacctcnatc nagtgagaca 
ttccaaatgg nctcatntcn aatgtttaaa 
atgttccacc aactagtacc tgtaatgacn 
gactgtggta ncccgcatcg gaaaaa 

<210> 107 
<211> 452 
<212> DNA 



nnatctctaa cagtggacaa agttcccngt 60 
gaaaanttng tantatgata ggttattctg 120 
tgcccccngt gtcagaggct catattgtgt 180 
antcaattng aaatanggtc tttgggttat 240 
ctgtngtatt cattgtggtc atagcacctc 300 
nactagnaan ttcctagtga tggctcanga 360 
agttanttaa gtgtaagaaa tacagactgg 420 
ggcctgtccc aacacatctc ccttttccat 480 

506 



<213> Homo sapiens 
<220> 

<221> misc_feature 
<222> 289, 317, 378 
<223> n — A, T,C or G 

<400> 107 1 
gttgagtctg tactaaacag taagatatct caatgaacca taaattcaac tttgtaaaaa 60 
tcttttgaag catagataat attgtttggt aaatgtttct tttgtttggt aaatgtttct 120 
tttaaagacc ctcctattct ataaaactct gcatgtagag gcttgtttac ctttctctct 180 
ctaaggttta caataggagt ggtgatttga aaaatataaa attatgagat tggttttcct 240 
gtggcataaa ttgcatcact gtatcatttt cttttttaac cggtaagant ttcagtttgt 300 
tggaaagtaa ctgtganaac ccagtttccc gtccatctcc cttagggact acccatagaa 360 
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catgaaaagg tccccacnga agcaagaaga taagtctttc atggctgctg gttgcttaaa 420 
ccactttaaa accaaaaaat tccccttgga aa 452 

<210> 108 

<211> 502 

<212> DNA 

<213> Homo sapiens 

<220> 

/ <221> mis cofeature 

<222> 22, 31, 126, 168, 183, 205, 219, 231, 236, 259, 283, 295, 
296, 298, 301, 340, 354, 378, 383, 409, 433, 446, 455, 466, 
488 

<223> n = A,T,C or G 



<400> 108 

atcttcttcc cttaattagt tnttatttat 

caaaaagaga ttgtagattg gcttctggct 

agaccncaac tgaagcttaa aaaatctatc 

tanagcatat aaaactttta acatntgctt 

aaaatgtccc tttaacatnc aatatcccac 

naaaaaaagg gtagaaggga tttaatgaaa 

ctccagaaca aaaacttntc aantctttca 

aaactccatt agncccactt tctaanggtc 

accctggnta ctcctgccct ca 



ntattaaatt ttattgcatg tcctggcaaa 60 
ccccaaaagc ccataacaga aagtaccaca 120 
acatgtataa tacctttnga agaacattaa 180 
aatgttgtnc aattataaaa ntaatngaaa 240 
atagtgttat ttnaggggat taccnngnaa 300 
actctgcttn ccatttctgt ttanaaacgt 360 
gctaaccgca tttgagctna ggccactcaa 420 
tctanagctt actaancctt ttgacccctt 480 

502 



<210> 109 

<211> 1308 

<212> DNA 

<213> Homo sapiens 



<400> 109 

acccgaggtc tcgctaaaat catcatggat 
tttgatcttt tcaaagagct gaagaaaaca 
ggcatcttga ctgcaattgg catggtcctc 
ttggaggagg tgtttcactc tgaaaaagag 
aaagaggtga ttgagaacac agaagcagta 
ataagcaaac tcactaatga ttatgaactg 
acatacctct tccttcaaaa atacttagat 
gaacctgttg attttgtaaa tgcagccgat 
gaaagcaaaa caaatgaaaa aatcaaggac 
accaagctgg tgctggtgaa catggtttat 
aaagaaaata ctaaggaaga gaaattttgg 
atgatgacac agagccattc ctttagcttc 
ctagggattc catataaaaa caacgaccta 
gatggcctgg agaagataat agataaaata 
ccagggcata tggaagaaag aaaggtgaat 
agttacgatc tagaggcggt cctggctgcc 
aaagccgact actcgggaat gtcgtcaggc 
agttcctttg tggcagtaac tgaggaaggc 
tttactgtca catccgcccc aggtcatgaa 
ttcatcaggc acaatgaatc caacagcatc 
gatgatcgtt gccatggcat tgctgctttt 
tgattatgaa aatcgtccat tcttttaaat 



tcacttggcg ccgtcagcac tcgacttggg 60 
aatgatggca acatcttctt ttcccctgtg 120 
ctggggacc<p gaggagccac cgcttcccag 180 
acgaagagct caagaataaa ggctgaagaa 240 
catcaacaat tccaaaagtt tttgactgaa 300 
aacataacca acaggctgtt tggagaaaaa 360 
tatgttgaaa aatattatca tgcatctctg 420 
gaaagtcgaa agaagattaa ttcctgggtt 480 
ttgttcccag atggctctat tagtagctct 540 
tttaaagggc aatgggacag ggagtttaag 600 
atgaataaga gcacaagtaa atctgtacag 660 
actttcctgg aggacttgca ggccaaaatt 720 
agcatgtttg tgcttctgcc caacgacatc 780 
agtcctgaga aattggtaga gtggactagt 840 
ctgcacttgc cccggtttga ggtggaggac 900 
atggggatgg gcgatgcctt cagtgagcac 960 
tccgggttgt acgcccagaa gttcctgcac 1020 
accgaggctg cagctgccac tggcataggc 1080 
aatgttcact gcaatcatcc qttcctgttc 1140 
ctcttcttcg gcagattttc ttctccttaa 1200 
agcaaaaaac aactaccagt gttactcata 1260 
ggtggctcac ttgcattt 1308 



<210> 110 
<211> 391 
<212> PRT 
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<213> Homo sapiens 



<<400> 110 

Met Asp Ser Leu Gly Ala Val Ser Thr Arg Leu Gly Phe Asp Leu Phe 

1 5 10 15 

Lys Glu Leu Lys Lys Thr Asn Asp Gly Asn He Phe Phe Ser Pro Val 

20 25 30 

Gly He Leu Thr Ala He Gly Met Val Leu Leu Gly Thr Arg Gly Ala 

35 40 45 

Thr Ala Ser Gin Leu Glu Glu Val Phe His Ser Glu Lys Glu Thr Lys 

50 55 60 

Ser Ser Arg He Lys Ala Glu Glu Lys Glu Val He Glu Asn Thr Glu 
65 70 75 80 

Ala Val His Gin Gin Phe Gin Lys Phe Leu Thr Glu He Ser Lys Leu 

85 90 95 

Thr Asn Asp Tyr Glu Leu Asn He Thr Asn Arg Leu Phe Gly Glu Lys 

100 105 " no 

Thr Tyr Leu Phe Leu Gin Lys Tyr Leu Asp Tyr Val Glu Lys Tyr Tvr 

115 120 125 

His Ala Ser Leu Glu Pro Val Asp Phe Val Asn Ala Ala Asp Glu Ser 
. 130 135 140 

Arg Lys Lys He Asn Ser Trp Val Glu Ser Lys Thr Asn Glu Lys He 
t 145 150 155 i 6 o 

Lys Asp Leu Phe Pro Asp Gly Ser He Ser Ser Ser Thr Lys Leu Val 

165 170 175 

Leu Val Asn Met Val Tyr Phe Lys Gly Gin Trp Asp Arg Glu Phe Lys 

180 185 190 

Lys Glu Asn Thr Lys Glu Glu Lys Phe Trp Met Asn Lys Ser Thr Ser 

195 200 205 

Lys Ser Val Gin Met Met Thr Gin Ser His Ser Phe Ser Phe Thr Phe 

21° 215 220 

Leu Glu Asp Leu Gin Ala Lys He Leu Gly He Pro Tyr Lys Asn Asn 
l 2S 230 235 240 

Asp Leu Ser Met Phe Val Leu Leu Pro Asn Asp He Asp Gly Leu Glu 

245 250 255 

Lys He He Asp Lys He Ser Pro Glu Lys Leu Val Glu Trp Thr Ser 

260 265 270 

Pro Gly His Met Glu Glu Arg Lys Val Asn Leu His Leu Pro Arg Phe 

275 280 285 

Glu Val Glu Asp Ser Tyr Asp Leu Glu Ala Val Leu Ala Ala Met Gly 

290 295 300 

Met Gly Asp Ala Phe Ser Glu His Lys Ala Asp Tyr Ser Gly Met Ser 
305 310 315 320 

Ser Gly Ser Gly Leu Tyr Ala Gin Lys Phe Leu His Ser Ser Phe Val 

325 330 335 

Ala Val Thr Glu Glu Gly Thr Glu Ala Ala Ala Ala Thr Gly He Gly 

340 345 350 

Phe Thr Val Thr Ser Ala Pro Gly His Glu Asn Val His Cys Asn His 

355 360 3 6 5 

Pro Phe Leu Phe Phe He Arg His Asn Glu Ser Asn Ser lie Leu Phe 

370 375 380 

Phe Gly Arg Phe Ser Ser Pro 1 
385 390 



<210> 111 
<211> 1419 
<212> DNA 
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<213> Homo sapiens 
«400> 111 

ggagaactat aaattaagga tcccagctac ttaattgact tatgcttcct agttcgttgc 60 

ccagccacca ccgtctctcc aaaaacccga ggtctcgcta aaatcatcat ggattcactt 120 

ggcgccgtca gcactcgact tgggtttgat cttttcaaag agctgaagaa aacaaatgat 180 

ggcaacatct tcttttcccc tgtgggcatc ttgactgcaa ttggcatggt cctcctgggg 240 

acccgaggag ccaccgcttc ccagttggag gaggtgtttc actctgaaaa agagacgaag 300 

agctcaagaa taaaggctga agaaaaagag gtggtaagaa taaaggctga aggaaaagag 360 

attgagaaca cagaagcagt acatcaacaa ttccaaaagt ttttgactga aataagcaaa 420 

ctcactaatg attatgaact gaacataacc aacaggctgt ttggagaaaa aacatacctc 480 

ttccttcaaa aatacttaga ttatgttgaa aaatattatc atgcatctct ggaacctgtt 540 

gattttgtaa atgcagccga tgaaagtcga aagaagatta attcctgggt tgaaagcaaa 600 

acaaatgaaa aaatcaagga cttgttccca gatggctcta ttagtagctc taccaagctg 660 

gtgctggtga acatggttta ttttaaaggg caatgggaca gggagtttaa gaaagaaaat 720 

actaaggaag agaaattttg gatgaataag agcacaagta aatctgtaca gatgatgaca 780 

cagagccatt cctttagctt cactttcctg gaggacttgc aggccaaaat tctagggatt 840 

ccatataaaa acaacgacct aagcatgttt gtgcttctgc ccaacgacat cgatggcctg 900 

gagaagataa tagataaaat aagtcctgag aaattggtag agtggactag tccagggcat 960 

atggaagaaa gaaaggtgaa tctgcacttg ccccggtttg aggtggagga cagttacgat 1020 

ctagaggcgg tcctggctgc catggggatg ggcgatgcct tcagtgagca caaagccgac 1080 

tactcgggaa tgtcgtcagg ctccgggttg tacgcccaga agttcctgca cagttccttt 1140 

gtggcagtaa ctgaggaagg caccgaggct gcagctgcca ctggcatagg ctttactgtc 1200 

acatccgccc caggtcatga aaatgttcac tgcaatcatc ccttcctgtt cttcatcagg 1260 

cacaatgaat ccaacagcat cctcttcttc ggcagatttt cttctcctta agatgatcgt 1320 

tgccatggca ttgctgcttt tagcaaaaaa caactaccag tgttactcat atgattatga 1380 

aaatcgtcca ttcttttaaa tggtggctca cttgcattt 1419 

<210> 112 
<211> 400 
<212> PRT 

<213>. Homo sapiens 
<400> 112 

Met Asp Ser Leu Gly Ala Val Ser Thr Arg Leu Gly Phe Asp Leu Phe 

1 5 10 5 . 15 

Lys Glu Leu Lys Lys Thr Asn Asp Gly Asn lie Phe Phe Ser Pro Val 

20 25 30 

Gly lie Leu Thr Ala lie Gly Met Val Leu Leu Gly Thr Arg Gly Ala 

35 40 45 

Thr Ala Ser Gin Leu Glu Glu Val Phe His Ser Glu Lys Glu Thr Lys 

50 55 60 

Ser Ser Arg lie Lys Ala Glu Glu Lys Glu Val Val Arg lie Lys Ala 
65 70 75 80 

Glu Gly Lys Glu lie Glu Asn Thr Glu Ala Val His Gin Gin Phe Gin 

85 90 95 

Lys Phe Leu Thr Glu lie Ser Lys Leu Thr Asn Asp Tyr Glu Leu Asn 

100 105 110 

lie Thr Asn Arg Leu Phe Gly Glu Lys Thr Tyr Leu Phe Leu Gin Lys 

115 120 125 

Tyr Leu Asp Tyr Val Glu Lys Tyr Tyr His Ala Ser Leu Glu Pro Val 

130 135 140 ' 

Asp Phe Val Asn Ala Ala Asp Glu Ser Arg Lys Lys lie Asn Ser Trp 
145 150 155 i , 160 

Val Glu Ser Lys Thr Asn Glu Lys He Lys Asp Leu Phe Pro Asp Gly 

165 170 175 

Ser He Ser Ser Ser Thr Lys Leu Val Leu Val Asn Met Val Tyr Phe 
180 185 190 
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Lys Gly Gin 
195 

^Lys Phe Trp 

210 
Gin Ser His 
225 

lie Leu Gly 

Leu Pro Asn 

Pro Glu Lys 
275 

Lys Val Asn 

290 
Leu Glu Ala 
305 

His Lys Ala 
Gin Lys Phe 

Glu Ala Ala 

355 

Gly His Glu 

370 
His Asn Glu 
385 



frp Asp Arg 

Met Asn Lys 

Ser Phe Ser 
230 

lie Pro Tyr 

245 
Asp He Asp 
260 

Leu Val Glu 

Leu His Leu 

Val Leu Ala 
310 

Asp Tyr Ser 

325 
Leu His Ser 
340 

Ala Ala Thr 

Asn Val His 

Ser Asn Ser 
390 



Glu Phe Lys 

200 
Ser Thr Ser 
215 

Phe Thr Phe 



Lys Glu Asn 



Lys Ser Val 
220 

Leu Glu Asp 
235 

Lys Asn Asn Asp Leu Ser 
250 

Lys He He 



Thr Lys 
205 

Gin Met 
Leu Gin 
Met Phe 



Gly Leu Glu 
265 

Trp Thr Ser 

280 
Pro Arg Phe 
295 

Ala Met Gly 

Gly Met Ser 

Ser Phe Val 
345 

Gly He Gly 

360 
Cys Asn His 
375 

He Leu Phe 



Asp Lys 
270 

Pro Gly His Met Glu 
285 

Asp Ser 



Glu Val Glu 
300 

Met Gly Asp 

315 
Ser Gly Ser 
330 

Ala Val Thr 

Phe Thr Val 

Pro Phe Leu 
380 

Phe Gly Arg 
395 



Ala Phe 

Gly Leu 

Glu Glu 
350 
Thr Ser 
365 

Phe Phe 
Phe Ser 



Glu Glu 

Met Thr 

Ala Lys 
240 
Val Leu 
255 

He Ser 

Glu Arg 

Tyr Asp 

Ser Glu 
320 
Tyr Ala 
335 

Gly Thr 

Ala Pro 

He Arg 

Ser Pro 
400 



<210> 113 
<211> 957 
<212> DNA 

<213> Homo sapiens 
<400> 113 

ctcgaccttc tctgcacagc ggatgaaccc tgagcagctg aagaccagaa aagccactat 60 
gactttctgc ttaattcagg agcttacagg attcttcaaa gagtgtgtcc agcatccttt 120 
gaaacatgag ttcttaccag cagaagcaga cctttacccc accacctcag cttcaacagc 180 
agcaggtgaa acaacccagc cagcctccac ctcaggaaat atttgttccc acaaccaagg 240 
agccatgcca ctcaaaggtt ccacaacctg gaaacacaaa gattccagag ccaggctgta 300 
ccaaggtccc tgagccaggc tgtaccaagg tccctgagcc aggttgtacc aaggtccctg 360 
agccaggatg taccaaggtc cctgagccag gttgtaccaa ggtccctgag ccaggctaca 420 
ccaaggtccc tgagccaggc agcatcaagg tccctgacca aggcttcatc aagtttcctg 480 
agccaggtgc catcaaagtt cctgagcaag gatacaccaa agttcctgtg ccaggctaca 540 
caaaggtacc agagccatgt ccttcaacgg tcactccagg cccagctcag cagaagacca 600 
agcagaagta atttggtgca cagacaagcc cttgagaagc caaccaccag atgctggaca 660 
ccctcttccc atctgtttct gtgtcttaat tgtctgtaga ccttgtaatc agtacattct 720 
caccccaagc catagtctct ctcttatttg tatcctaaaa atacggtact ataaagcttt 780 
tgttcacaca cactctgaag aatcctgtaa gcccctgaat taagcagaaa gtcttcatgg 840 
cttttctggt cttcggctgc tcagggttca tctgaagatt cgaatgaaaa gaaatgcatg 900 
tttcctgctc tgccctcatt aaattgcttt taattccaaa aaaaaaaaaa aaaaaaa 957 

<210> 114 i 
<211> 161 
<212> PRT 

<213> Homo sapiens 
<400> 114 

Met Ser Ser Tyr Gin Gin Lys Gin Thr Phe Thr Pro Pro Pro Gin Leu 
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1 




5 




10 


15 




Gin Gin 


Gin Gin Val Lys 


Gin Pro Ser Gin 


Pro Pro Pro Gin Glu 


He 




20 






25 


30 




Phe Val 


Pro Thr 


Thr Lys 


Glu Pro Cys His 


Ser Lys Val Pro Gin 


Pro 




35 






40 


45 




Gly Asn 


Thr Lys 


lie 


Pro 


Glu Pro Gly Cys 


Thr Lys Val Pro Glu 


Pro 


50 








55 


60 




Gly Cys 


Thr Lys 


Val 


Pro 


Glu Pro Gly Cys 


Thr Lys Val Pro Glu 


Pro 


65 






70 




75 


80 


Gly Cys 


Thr Lys 


Val 


Pro 


Glu Pro Gly Cys 


Thr Lys Val Pro Glu 


Pro 






85 




90 


95 




Gly Tyr 


Thr Lys 


Val 


Pro 


Glu Pro Gly Ser 


lie Lys Val Pro Asp 


Gin 




100 






105 


110 




Gly' Phe 


lie Lys 


Phe 


Pro 


Glu Pro Gly Ala 


lie Lys Val Pro Glu 


Gin 




115 






120 


125 




Gly Tyr 


Thr Lys 


Val 


Pro 


Val Pro Gly Tyr 


Thr Lys Val Pro Glu 


Pro 


130 








135 


140 




Cys Pro 


Ser Thr 


Val 


Thr 


Pro Gly Pro Ala 


Gin Gin Lys Thr Lys 


Gin 


145 






150 




155 


160 


Lys 















<210> 115 
<211> 506 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<222> 8, 21, 31, 32, 58, 75, 89, 96, 99, 103, 122, 126, 147, 150, 
158, 195, 210, 212, 219, 226, 246, 248, 249, 255, 258, 261, 
263, 265, 275, 304, 317, 321, 331, 337, 340, 358, 371, 377, 
380, 396, 450, 491 
<223> n « A,T,C or G 

<400> 115 

cattggtnct ttcatttgct 
gccttaaact ctgtnacact 
angtanagat gttctggata 
tatgtaaatg gtatntcatt 
gaatantnng cagcncanct 
acancattgt aacctcnatc 
ttccaaatgg nctcatntcn 
atgttccacc aactagtacc 
gactgtggta ncccgcatcg 

<210> 116 
<211> 3079 
<212> DNA 

<213> Homo sapiens 
<400> 116 

ggatccccgg gtttcctaaa 
ggtcaaaggg cagaaaaaat 
aaagaggtca aagtggttta 
cttgcaggca gatctgccca 
tgtgcacaaa aggatgaaac 



ntggaagtgt 
tttgggaant 
ccattanatn 
cgctactatn 
nanangctgt 
nagtgagaca 
aatgtttaaa 
tgtaatgacn 
gaaaaa 



nnatctctaa 
gaaaanttng 
tgcccccngt 
antcaattng 
ctgtngtatt 
nactagnaan 
agttanttaa 
ggcctgtccc 



cagtggacaa 
tantatgata 
gtcagaggct 
aaatanggtc 
cattgtggtc 
ttcctagtga 
gtgtaagaaa 
aacacatctc 



agttcccngt 60 
ggttattctg 120 
catattgtgt 180 
tttgggttat 240 
atagcacctc 300 
tggctcanga 360 
tacagactgg 420 
ccttttccat 480 
506 



ccccccacag 
gctgagttag 
tagggggcgc 
gtgggctctg 
tctattttcc 



agtcctgccc 
gaggagctat 
tgagggcttc 
ggatagctgt 
ctctagcaca 



aggccaaaga 
ggaaggataa 
ccacattctc 
gccttcccta 
taaccaagaa 



gcaaggaaaa 60 
acctggcctt 120 
tggcctaaac 180 
acaaaaaaat 240 
tataaggcta 300 
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cagattgcct ttcccagagg gaaaaccctg cagcaacctg ctgcctggaa aagtgtaaga 360 
gcagatcact ggggaatcgt ttgccccccg ctgatggaca gcttccccaa gctccaaggg 420 
<caggtgctca gcatgtaccg tactgggatg gttgtcaata ctcctggtcc tgtaagagtc 480 
ccaggacact gccatgccaa tgccccctca gttcctggca tcctttttgg gctgctcaca 540 
gccccagcct ctatggtgaa gacatacttg ctagcagcgt caccaacttg ttgccaagag 600 
atcagtgctc gaaggcaagg ttatttctaa ctgagcagag cctgccagga agaaagcgtt 660 
tgcaccccac accactgtgc aggtgtgacc ggtgagctca cagctgcccc ccaggcatgc 720 
ccagcccact taatcatcac agctcgacag ctctctcgcc cagcccagtt ctggaaggga 780 
taaaaagggg catcaccgtt cctgggtaac agagccacct tctgcgtcct gctgagctct 840 
gttctctcca gcacctccca acccactagt gcctggttct cttgctccac caggaacaag 900 
ccaccatgtc tcgccagtca agtgtgtctt ccggagcggg gggcagtcgt agcttcagca 960 
ccgcctctgc catcaccccg tctgtctccc gcaccagctt cacctccgtg tcccggtccg 1020 
ggggtggcgg tggtggtggc ttcggcaggg tcagccttgc gggtgcttgt ggagtgggtg 1080 
gctatggcag ccggagcctc tacaacctgg ggggctccaa gaggatatcc atcagcacta 1140 
gtggtggcag cttcaggaac cggtttggtg ctggtgctgg aggcggctat ggctttggag 1200 
gtggtgccgg tagtggattt ggtttcggcg gtggagctgg tggtggcttt gggctcggtg 12 60 
gcggagctgg ctttggaggt ggcttcggtg gccctggctt tcctgtctgc cctcctggag 1320 
gtatccaaga ggtcactgtc aaccagagtc tcctgactcc cctcaacctg caaatcgacc 1380 
ccagcatcca gagggtgagg accgaggagc gcgagcagat caagaccctc aacaataagt 1440 
ttgcctcctt catcgacaag gtgcggttcc tggagcagca gaacaaggtt ctggaaacaa 1500 
agtggaccct gctgcaggag cagggcacca agactgtgag gcagaacctg gagccgttgt 1560 
tcgagcagta catcaacaac ctcaggaggc agctggacag catcgtgggg gaacggggcc 1620 
gcctggactc agagctgaga aacatgcagg acctggtgga agacttcaag aacaagtatg 1680 
aggatgaaat caacaagcgt accactgctg agaatgagtt tgtgatgctg aagaaggatg 1740 
tagatgctgc ctacatgaac aaggtggagc tggaggccaa ggttgatgca ctgatggatg 1800 
agattaactt catgaagatg ttctttgatg cggagctgtc ccagatgcag acgcatgtct 1860 
ctgacacctc agtggtcctc tccatggaca acaaccgcaa cctggacctg gatagcatca 1920 
tcgctgaggt caaggcccag tatgaggaga ttgccaaccg cagccggaca gaagccgagt 1980 
cctggtatca gaccaagtat gaggagctgc agcagacagc tggccggcat ggcgatgacc 2040 
tccgcaacac caagcatgag atctctgaga tgaaccggat gatceagagg ctgagagccg 2100 
agattgacaa tgtcaagaaa cagtgcgcca atctgcagaa cgccattgcg gatgccgagc 2160 
agcgtgggga gctggccctc aaggatgcca ggaacaagct ggccgagctg gaggaggccc 2220 
tgcagaaggc caagcaggac atggcccggc tgctgcgtga gtaccaggag ctcatgaaca 2280 
ccaagctggc cctggacgtg gagatcgcca cttaccgcaa gctgctggag ggcgaggaat 2340 
gcagactcag tggagaagga gttggaccag tcaacatctc, tgttgtcaca agcagtgttt 2400 
cctctggata tggcagtggc agtggctatg gcggtggcct cggtggaggt cttggcggcg 24 60 
gcctcggtgg aggtcttgcc ggaggtagca gtggaagcta ctactccagc agcagtgggg 2520 
gtgtcggcct aggtggtggg ctcagtgtgg ggggctctgg cttcagtgca agcagtagcc 2580 
gagggctggg ggtgggcttt ggcagtggcg ggggtagcag ctccagcgtc aaatttgtct 2640 
ccaccacctc ctcctcccgg aagagcttca agagctaaga acctgctgca agtcactgcc 2700 
ttccaagtgc agcaacccag cccatggaga ttgcctcttc taggcagttg ctcaagccat 2760 
gttttatcct tttctggaga gtagtctaga ccaagccaat tgcagaacca cattctttgg 2820 
ttcccaggag agccccattc ccagcccctg gtctcccgtg ccgcagttct atattctgct 2880 
tcaaatcagc cttcaggttt cccacagcat ggcccctgct gacacgagaa cccaaagttt 2940 
tcccaaatct aaatcatcaa aacagaatcc ccaccccaat cccaaatttt gttttggttc 3000 
taactacctc cagaatgtgt tcaataaaat gttttataat ataagctggt gtgcagaatt 3060 
gttttttttt tctacccaa 3079 

<210> 117 
<211> 6921 
<212> DNA 

<213> Homo sapiens ! 
<400> 117 

gaattctgac tgtccactca aaacttctat tccgatcaaa gctatctgtg actacagaca 60 
aattgagata accatttaca aagacgatga atgtgttttg gcgaataact ctcatcgtgc 120 
taaatggaag gtcattagtc ctactgggaa tgaggctatg gtcccatctg tgtgcttcac 180 
cgttcctcca ccaaacaaag aagcggtgga ccttgccaac agaattgagc aacagtatca 240 



WO 02/00174 PCT/US01/21065 

50 

gaatgtcctg actctttggc atgagtctca cataaacatg aagagtgtag tatcctggca 300 
ttatctcatc aatgaaattg atagaattcg agctagcaat gtggcttcaa taaagacaat 360 
gctacctggt gaacatcagc aagttctaag taatctacaa tctcgttttg aagattttct 420 
ggaagatagc caggaatccc aagtcttttc aggctcagat ataacacaac tggaaaagga 480 
ggttaatgta tgtaagcagt attatcaaga acttcttaaa tctgcagaaa gagaggagca 540 
agaggaatca gtttataatc tctacatctc tgaagttcga aacattagac ttcggttaga 600 
gaactgtgaa gatcggctga ttagacagat tcgaactccc ctggaaagag atgatttgca 660 
tgaaagtgtg ttcagaatca cagaacagga gaaactaaag aaagagctgg aacgacttaa 720 
agatgatttg ggaacaatca caaataagtg tgaggagttt ttcagtcaag cagcagcctc 780 
/ ttcatcagtc cctaccctac gatcagagct taatgtggtc cttcagaaca tgaaccaagt 840 
ctattctatg tcttccactt acatagataa gttgaaaact gttaacttgg tgttaaaaaa 900 
cactcaagct gcagaagccc tcgtaaaact ctatgaaact aaactgtgtg aagaagaagc 960 
agttatagct gacaagaata atattgagaa tctaataagt actttaaagc aatggagatc 1020 
tgaagtagat gaaaagagac aggtattcca tgccttagag gatgagttgc agaaagctaa 1080 
agccatcagt gatgaaatgt ttaaaacgta taaagaacgg gaccttgatt ttgactggca 1140 
caaagaaaaa gcagatcaat tagttgaaag gtggcaaaat gttcatgtgc agattgacaa 1200 
caggttacgg gacttagagg gcattggcaa atcactgaag tactacagag acacttacca 1260 
tcctttagat gattggatcc agcaggttga aactactcag agaaagattc aggaaaatca 1320 
gcctgaaaat agtaaaaccc tagccacaca gttgaatcaa cagaagatgc tggtgtccga 1380 
aatagaaatg aaacagagca aaatggacga gtgtcaaaaa tatgcagaac agtactcagc 1440 
tacagtgaag gactatgaat tacaaacaat gacctaccgg gccatggtag attcacaaca 1500 
aaaatctcca gtgaaacgcc gaagaatgca gagttcagca gatctcatta ttcaagagtt 1560 
catggaccta aggactcgat atactgccct ggtcactctc atgacacaat atattaaatt 1620 
tgctggtgat tcattgaaga ggctggaaga ggaggagatt aaaaggtgta aggagacttc 1680 
tgaacatggg gcatattcag atctgcttca gcgtcagaag gcaacagtgc ttgagaatag 1740 
caaacttaca ggaaagataa gtgagttgga aagaatggta gctgaactaa agaaacaaaa 1800 
gtcccgagta gaggaagaac ttccgaaggt cagggaggct gcagaaaatg aattgagaaa 1860 
gcagcagaga aatgtagaag atatctctct gcagaagata agggctgaaa gtgaagccaa 1920 
gcagtaccgc agggaacttg aaaccattgt gagagagaag gaagccgctg aaagagaact 1980 
ggagcgggtg aggcagctca ccatagaggc cgaggctaaa agagctgccg tggaagagaa 2040 
cctcctgaat tttcgcaatc agttggagga aaacaccttt accagacgaa cactggaaga 2100 
tcatcttaaa agaaaagatt taagtctcaa tgatttggag caacaaaaaa ataaattaat 2160 
ggaagaatta agaagaaaga gagacaatga * ggaagaactc ttgaagctga taaagcagat 2220 
ggaaaaagac cttgcatttc agaaacaggt agcagagaaa cagttgaaag aaaagcagaa 2280 
aattgaattg gaagcaagaa gaaaaataac tgaaattcag tatacatgta gagaaaatgc 2340 
attgccagtg tgtccgatca cacaggctac atcatgcagg gcagtaacgg gtctccagca 2400 
agaacatgac aagcagaaag cagaagaact caaacagcag gtagatgaac taacagctgc 2460 
caatagaaag gctgaacaag acatgagaga gctgacatat gaacttaatg ccctccagct 2520 
tgaaaaaacg tcatctgagg aaaaggctcg tttgctaaaa gataaactag atgaaacaaa 2580 
taatacactc agatgcctta agttggagct ggaaaggaag gatcaggcgg agaaagggta 2640 
ttctcaacaa ctcagagagc ttggtaggca attgaatcaa accacaggta aagctgaaga 2700 
agccatgcaa gaagctagtg atctcaagaa aataaagcgc aattatcagt tagaattaga 2760 
atctcttaat catgaaaaag ggaaactaca aagagaagta gacagaatca caagggcaca 2820 
tgctgtagct gagaagaata ttcagcattt aaattcacaa attcattctt ttcgagatga 2880 
gaaagaatta gaaagactac aaatctgcca gagaaaatca gatcatctaa aagaacaatt 2940 
tgagaaaagc catgagcagt tgcttcaaaa tatcaaagct gaaaaagaaa ataatgataa 3000 
aatccaaagg ctcaatgaag aattggagaa aagtaatgag tgtgcagaga tgctaaaaca 3060 
aaaagtagag gagcttacta ggcagaataa tgaaaccaaa ttaatgatgc agagaattca 3120 
ggcagaatca gagaatatag ttttagagaa acaaactatc cagcaaagat gtgaagcact 3180 
gaaaattcag gcagatggtt ttaaagatca gctacgcagc acaaatgaac acttgcataa 3240 
acagacaaaa acagagcagg attttcaaag aaaaattaaa tgcctagaag aagacctggc 3300 
gaaaagtcaa aatttggtaa gtgaatttaa gcaaaagtgt gaccaacaga acattatcat 3360 
ccagaatacc aagaaagaag ttagaaatct gaatgcggaa ctgaatgctt ccaaagaaga 3420 
gaagcgacgc ggggagcaga aagttcagct acaacaagct caggtgcaag agttaaataa 3480 
caggttgaaa aaagtacaag acgaattaca cttaaagacc atagaggagc agatgaccca 3540 
cagaaagatg gttctgtttc aggaagaatc tggtaaattc aaacaatcag cagaggagtt 3600 
tcggaagaag atggaaaaat taatggagtc caaagtcatc actgaaaatg atatttcagg 3660 
cattaggctt gactttgtgt ctcttcaaca agaaaactct agagcccaag aaaatgctaa 3720 
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gctttgtgaa acaaacatta aagaacttga aagacagctt caacagtatc gtgaacaaat 3780 
gcagcaaggg cagcacatgg aagcaaatca ttaccaaaaa tgtcagaaac ttgaggatga 3840 
^gctgatagcc cagaagcgtg aggttgaaaa cctgaagcaa aaaatggacc aacagatcaa 3900 
agagcatgaa catcaattag ttttgctcca gtgtgaaatt caaaaaaaga gcacagccaa 3960 
agactgtacc ttcaaaccag attttgagat gacagtgaag gagtgccagc actctggaga 4020 
gctgtcctct agaaacactg gacaccttca cccaacaccc agatcccctc tgttgagatg 4080 
gactcaagaa ccacagccat tggaagagaa gtggcagcat cgggttgttg aacagatacc 4140 
caaagaagtc caattccagc caccaggggc tccactcgag aaagagaaaa gccagcagtg 4200 
ttactctgag tacttttctc agacaagcac cgagttacag ataacttttg atgagacaaa 4260 
ccccattaca agactgtctg aaattgagaa gataagagac caagccctga acaattctag 4320 
accacctgtt aggtatcaag ataacgcatg tgaaatggaa ctggtgaagg ttttgacacc 4380 
cttagagata gctaagaaca agcagtatga tatgcataca gaagtcacaa cattaaaaca 4440 
agaaaagaac ccagttccca gtgctgaaga atggatgctt gaagggtgca gagcatctgg 4500 
tggactcaag aaaggggatt tccttaagaa gggcttagaa ccagagacct tccagaactt 4560 
tgatggtgat catgcatgtt cagtcaggga tgatgaattt aaattccaag ggcttaggca 4 620 
cactgtgact gccaggcagt tggtggaagc taagcttctg gacatgagaa caattgagca 4680 
gctgcgactc ggtcttaaga ctgttgaaga agttcagaaa actcttaaca agtttctgac 4740 
gaaagccacc tcaattgcag ggctttacct agaatctaca aaagaaaaga tttcatttgc 4800 
ctcagcggcc gagagaatca taatagacaa aatggtggct ttggcatttt tagaagctca 4860 
ggctgcaaca ggttttataa ttgatcccat ttcaggtcag acatattctg ttgaagatgc 4920 
agttcttaaa ggagttgttg accccgaatt cagaattagg cttcttgagg cagagaaggc 4980 
agctgtggga tattcttatt cttctaagac attgtcagtg tttcaagcta tggaaaatag 5040 
aatgcttgac agacaaaaag gtaaacatat cttggaagcc cagattgcca gtgggggtgt 5100 
cattgaccct gtgagaggca ttcgtgttcc tccagaaatt gctctgcagc aggggttgtt 5160 
gaataatgcc atcttacagt ttttacatga gccatccagc aacacaagag ttttccctaa 5220 
tcccaataac aagcaagctc tgtattactc agaattactg cgaatgtgtg tatttgatgt 5280 
agagtcccaa tgctttctgt ttccatttgg ggagaggaac atttccaatc tcaatgtcaa 5340 
gaaaacacat agaatttctg tagtagatac taaaacagga tcagaattga ccgtgtatga 5400 
ggctttccag agaaacctga ttgagaaaag tatatatctt gaactttcag ggcagcaata 5460 
tcagtggaag gaagctatgt tttttgaatc ctatgggcat tcttctcata tgctgactga 5520 
tactaaaaca ggattacact tcaatattaa tgaggctata gagcagggaa caattgacaa 5580 
agccttggtc.aaaaagtatc aggaaggcct catcacactt acagaacttg ctgattcttt 5640 
gctgagccgg ttagtcccca agaaagattt gcacagtcct gttgcagggt attggctgac 5700 
tgctagtggg gaaaggatct ctgtactaaa agcctcccgt agaaatttgg ttgatcggat 5760 
tactgccctc cgatgccttg aagcccaagt cagtacaggg ggcataattg atcctcttac 5820 
tggcaaaaag taccgggtgg ccgaagcttt gcatagaggc ctggttgatg aggggtttgc 5880 
ccagcagctg cgacagtgtg aattagtaat cacagggatt ggccatccca tcactaacaa 5940 
aatgatgtca gtggtggaag ctgtgaatgc aaatattata aataaggaaa tgggaatccg 6000 
atgtttggaa tttcagtact tgacaggagg gttgatagag ccacaggttc actctcggtt 6060 
atcaatagaa gaggctctcc aagtaggtat tatagatgtc ctcattgcca caaaactcaa 6120 . 
agatcaaaag tcatatgtca gaaatataat atgccctcag acaaaaagaa agttgacata 6180 
taaagaagcc ttagaaaaag ctgattttga tttccacaca ggacttaaac tgttagaagt 6240 
atctgagccc ctgatgacag gaatttctag cctctactat tcttcctaat gggacatgtt 6300 
taaataactg tgcaaggggt gatgcaggct ggttcatgcc actttttcag agtatgatga 6360 
tatcggctac atatgcagtc tgtgaattat gtaacatact ctatttcttg agggctgcaa 6420 
attgctaagt gctcaaaata gagtaagttt taaattgaaa attacataag atttaatgcc 6480 
cttcaaatgg tttcatttag ccttgagaat ggttttttga aacttggcca cactaaaatg 6540 
tttttttttt tttacgtaga atgtgggata aacttgatga actccaagtt cacagtgtca 6600 
tttcttcaga actccccttc attgaatagt gatcatttat taaatgataa attgcactcg 6660 
ctgaaagagc acgtcatgaa gcaccatgga atcaaagaga aagatataaa ttcgttccca 6720 
cagccttcaa gctgcagtgt tttagattgc ttcaaaaaat gaaaaagttt tgcctttttc 6780 
gatatagtga ccttctttgc atattaaaat gtttaccaca atgtcccatt tctagttaag 6840 
tcttcgcact tgaaagctaa cattatgaat attatgtgtt ggaggagggg aaggattttc 6900 
ttcattctgt gtattttccg g 6921 



<210> 118 
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<211> 946 
<212> DNA 
<213> Homo sapiens 

<400> 118 

cttctgactg ggctcaggct gacaggtaga gctcaccatg gcttcttgtg tccttgtccc 60 
ctccccatca cagctgtggt gcagtccacc gtctccagtg gctatggcgg tgccagtggt 120 
gtcggcagtg gcttaggcct gggtggagga agcagctact cctatggcag tggtcttggc 180 
gttggaggtg gcttcagttc cagcagtggc agagccattg ggggtggcct cagctctgtt 240 
ggaggcggca gttccaccat caagtacacc accacctcct cctccagcag gaagagctat 300 
aagcactaaa gtgcgtctgc tagctctcgg tcccacagtc ctcaggcccc tctctggctg 360 
cagagccctc tcctcaggtt gcctgtcctc tcctggcctc cagtctcccc tgctgtccca 420 
ggtagagctg gggatgaatg cttagtgccc tcacttcttc tctctctctc tataccatct 480 
gagcacccat tgctcaccat cagatcaacc tctgatttta catcatgatg taatcaccac 540 
tggagcttca ctgttactaa attattaatt tcttgcctcc agtgttctat ctctgaggct 600 
gagcattata agaaaatgac ctctgctcct tttcattgca gaaaattgcc aggggcttat 660 
ttcagaacaa cttccactta ctttccactg gctctcaaac tctctaactt ataagtgttg 720 
tgaaccccca cccaggcagt atccatgaaa gcacaagtga ctagtcctat gatgtacaaa 780 
gcctgtatct ctgtgatgat ttctgtgctc ttcactgttt gcaattgcta aataaagcag 840 
atttataata catatattct tttactttgc cttgctttgg ggccaaagtt ttgggcttaa 900 
acttttttat ctgataagtg aatagttgtt tttaaaagat aatcta 946 

<210> 119 

<211> 8948 

<212> DNA 

<213> Homo sapiens 

<400> 119 

tcaacagccc ctgctccttg ggcccctcca tgccatgccg taatctctcc cacccgacca 60 
acaccaacac ccagctccga cgcagctcct ctgcgccctt gccgccctfcc gagccacagc 120 
tttcctcccg ctcctgcccc cggcccgtcg ccgtctccgc gctcgcagcg gcctcgggag 180 
ggcccaggta gcgagcagcg acctcgcgag ccttccgcac tcccgcccgg ttccccggcc 240 
gtccgcctat ccttggcccc ctccgctttc tccgcgccgg cccgcctcgc ttatgcctcg 300 
gcgctgagcc gctctcccga ttgcccgccg acatgagctg caacggaggc tcccacccgc 360 
ggatcaacac tctgggccgc atgatccgcg ccgagtctgg cccggacctg cgctacgagg 420 
tgaccagcgg cggcgggggc accagcagga tgtactattc tcggcgcggc gtgatcaccg 480 
accagaactc ggacggctac tgtcaaaccg gcacgatgtc caggcaccag aaccagaaca 540 
ccatccagga gctgctgcag aactgctccg actgcttgat gcgagcagag ctcatcgtgc 600 
agcctgaatt, gaagtatgga gatggaatac aactgactcg gagtcgagaa ttggatgagt 660 
gttttgccca ggccaatgac caaatggaaa tcctcgacag cttgatcaga gagatgcggc 720 
agatgggcca gccctgtgat gcttaccaga aaaggcttct tcagctccaa gagcaaatgc 780 
gagcccttta taaagccatc agtgtccctc gagtccgcag ggccagctcc aagggtggtg 840 
gaggctacac ttgtcagagt ggctctggct gggatgagtt caccaaacat gtcaccagtg 900 
aatgtttggg gtggatgagg cagcaaaggg cggagatgga catggtggcc tggggtgtgg 960 
acctggcctc agtggagcag cacattaaca gccaccgggg catccacaac tccatcggcg 1020 
actatcgctg gcagctggac aaaatcaaag ccgacctgcg cgagaaatct gcgatctacc 1080 
agttggagga ggagtatgaa aacctgctga aagcgtcctt tgagaggatg gatcacctgc 1140 
gacagctgca gaacatcatt caggccacgt ccagggagat catgtggatc aatgactgcg 1200 
aggaggagga gctgctgtac gactggagcg acaagaacac caacatcgct cagaaacagg 1260 
aggccttctc catacgcatg agtcaactgg aagttaaaga aaaagagctc aataagctga 1320 
aacaagaaag tgaccaactt gtcctcaatc agcatccagc ttcagacaaa ajttgaggcct 1380 
atatggacac tctgcagacg cagtggagtt ggattcttca gatcaccaag tgcattgatg 1440 
ttcatctgaa agaaaatgct gcctactttc agttttttga agaggcgcag tctactgaag 1500 
catacctgaa ggggctccag gactccatca ggaagaagta cccctgcgac aagaacatgc 1560 
ccctgcagca cctgctggaa cagatcaagg agctggagaa agaacgagag aaaatccttg 1620 
aatacaagcg tcaggtgcag aacttggtaa acaagtctaa gaagattgta cagctgaagc 1680 
ctcgtaaccc agactacaga agcaataaac ccattattct cagagctctc tgtgactaca 1740 
aacaagatca gaaaatcgtg cataaggggg atgagtgtat cctgaaggac aacaacgagc 1800 
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gcagcaagtg ^tacgtgacg ggcccgggag gcgttgacat gcttgttccc tctgtggggc 1860 
tgatcatccc tcctccgaac ccactggccg tggacctctc ttgcaagatt gagcagtact 1920 
, acgaagccat cttggctctg tggaaccagc tctacatcaa catgaagagc ctggtgtcct 1980 
ggcactactg catgattgac atagagaaga tcagggccat gacaatcgcc aagctgaaaa 2040 
caatgcggca ggaagattac atgaagacga tagccgacct tgagttacat taccaagagt 2100 
tcatcagaaa tagccaaggc tcagagatgt ttggagatga tgacaagcgg aaaatacagt 2160 ' 
ctcagttcac cgatgcccag aagcattacc agaccctggt cattcagctc cctggctatc 2220 
cccagcacca gacagtgacc acaactgaaa tcactcatca tggaacctgc caagatgtca 2280 
accataataa agtaattgaa accaacagag aaaatgacaa gcaagaaaca tggatgctga 2340 
tggagctgca gaagattcgc aggcagatag agcactgcga gggcaggatg actctcaaaa 2400 
acctccctct agcagaccag gggtcttctc accacatcac agtgaaaatt aacgagctta 2460 
agagtgtgca gaatgattca caagcaattg ctgaggttct caaccagctt aaagatatgc 2520 
ttgccaactt cagaggttct gaaaagtact gctatttaca gaatgaagta tttggactat 2580 
ttcagaaact ggaaaatatc aatggtgtta cagatggcta cttaaatagc ttatgcacag 2640 
taagggcact gctccaggct attctccaaa cagaagacat gttaaaggtt tatgaagcca 2700 
ggctcactga ggaggaaact gtctgcctgg acctggataa agtggaagct taccgctgtg 2760 
gactgaagaa aataaaaaat gacttgaact tgaagaagtc gttgttggcc actatgaaga 2820 
cagaactaca gaaagcccag cagatccact ctcagacttc acagcagtat ccactttatg 2880 
atctggactt gggcaagttc ggtgaaaaag tcacacagct gacagaccgc tggcaaagga 2940 
tagataaaca gatcgacttt agattatggg acctggagaa acaaatcaag caattgagga 3000 
attatcgtga taactatcag gctttctgca agtggctcta tgatcgtaaa cgccgccagg 3060 
attccttaga atccatgaaa tttggagatt ccaacacagt catgcggttt ttgaatgagc 3120 
agaagaactt gcacagtgaa atatctggca aacgagacaa atcagaggaa gtacaaaaaa 3180 
ttgctgaact ttgcgccaat tcaattaagg attatgagct ccagctggcc tcatacacct 3240 
caggactgga aactctgctg aacataccta tcaagaggac catgattcag tccccttctg 3300 
gggtgattct gcaagaggct gcagatgttc atgctcggta cattgaacta cttacaagat 3360 
ctggagacta ttacaggttc ttaagtgaga tgctgaagag tttggaagat ctgaagctga 3420 
aaaataccaa gatcgaagtt ttggaagagg agctcagact ggcccgagat gccaactcgg 3480 
aaaactgtaa taagaacaaa ttcctggatc agaacctgca gaaataccag gcagagtgtt 3540 
cccagttcaa agcgaagctt gcgagcctgg aggagctgaa gagacaggct gagctggatg 3600 
ggaagtcggc taagcaaaat ctagacaagt gctacggcca aataaaagaa ctcaatgaga 3660 
agatcacccg actgacttat gagattgaag atgaaaagag aagaagaaaa tctgtggaag 3720 
acagatttga ccaacagaag aatgactatg accaactgca gaaagcaagg caatgtgaaa 3780 
aggagaacct tggttggcag aaattagagt ctgagaaagc catcaaggag aaggagtacg 3840 
agattgaaag gttgagggtt ctactgcagg aagaaggcac ccggaagaga gaatatgaaa 3900 
atgagctggc aaaggtaaga aaccactata atgaggagat gagtaattta aggaacaagt 3960 
atgaaacaga gattaacatt acgaagacca ccatcaagga gatatccatg caaaaagagg 4020 
atgattccaa aaatcttaga aaccagcttg atagactttc aagggaaaat cgagatctga 4080 
aggatgaaat tgtcaggctc aatgacagca tcttgcaggc cactgagcag cgaaggcgag 4140 
ctgaagaaaa cgcccttcag caaaaggcct gtggctctga gataatgcag aagaagcagc 4200 
atctggagat agaactgaag caggtcatgc agcagcgctc tgaggacaat gcccggcaca 4260 
agcagtccct ggaggaggct gccaagacca ttcaggacaa aaataaggag atcgagagac 4320 
tcaaagctga gtttcaggag gaggccaagc gccgctggga atatgaaaat gaactgagta 4380 
aggtaagaaa caattatgat gaggagatca ttagcttaaa aaatcagttt gagaccgaga 4440 
tcaacatcac caagaccacc atccaccagc tcaccatgca gaaggaagag gataccagtg 4500 
gctaccgggc tcagatagac aatctcaccc gagaaaacag gagcttatct gaagaaataa 4560 
agaggctgaa gaacactcta acccagacca cagagaatct caggagggtg gaagaagaca 4620 
tccaacagca aaaggccact ggctctgagg tgtctcagag gaaacagcag ctggaggttg 4680 
agctgagaca agtcactcag atgcgaacag aggagagcgt aagatataag caatctcttg 4740 
atgatgctgc caaaaccatc caggataaaa acaaggagat agaaaggtta aaacaactga 4800 
tcgacaaaga aacaaatgac cggaaatgcc tggaagatga aaacgcgaga ttacaaaggg 4860 
tccagtatga cctgcagaaa gcaaacagta gtgcgacgga gacaataaac aWactgaagg 4920 
ttcaggagca agaactgaca cgcctgagga tcgactatga aagggtttcc caggagagga 4980 
ctgtgaagga ccaggatatc acgcggttcc agaactctct gaaagagctg cagctgcaga 5040 
agcagaaggt ggaagaggag ctgaatcggc tgaagaggac cgcgtcagaa gactcctgca 5100 
agaggaagaa gctggaggaa gagctggaag gcatgaggag gtcgctgaag gagcaagcca 5160 
tcaaaatcac caacctgacc cagcagctgg agcaggcatc cattgttaag aagaggagtg 5220 
aggatgacct ccggcagcag agggacgtgc tggatggcca cctgagggaa aagcagagga 5280 
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cccaggaaga gctgaggagg ctctcttctg 
aggaacagga aagtgtcaaa caagctcact 
^agataaaag cagaagctta aatgaaagca 
cagagaacct gaccaaggag cacttgatgt 
agtacgatga cctgaggaga ggacgaagcg 
tggaactaag gagccagctg cagatcagca 
ttaatgattt acagagagag agggaaaatt 
aggctttaga ggcatctaat aggattcagg 
aggaaagaga gagccttctg gtgaaaatca 
agaggctgga ggatgagctg aatcgtgcaa 
aacagcgcct ggagtgtgag aaacagcaaa 
aatattcccg caaggaggag gctattagga 
gagagaagaa cagtcttagg agtgagatcg 
aagagaggtg caggcgtaag ctggaggatt 
cagaacgctc ccgatatcag agggagattg 
atcgagagac ccagactgag tgtgagtgga 
ggctgaggaa gaaggtgaca gcaatgcagc 
ccttggacaa actattgaag gggaagaagt 
cattccttcg gggtgcagga tctatcgctg 
ctttggtaga ggccaagaga aagaaattaa 
aggcccaggc agctacaggt ggtataattg 
acagtgccat agctcgggac ctcattgact 
aaaaagctat cactggtttt gatgatccat 
ccatcaagaa aaatttgatt gatagagaaa 
cttcaggggg tgtagtagac cctgtgaaca 
cccgggggct gattgataga gatttgtatc 
aaaactttgt ggatccagtc accaaaaaga 
gcagaatcga accacatact ggtctgctct 
tccaaggaat cagacaacct gtgaccgtca 
cgtccactgt caatgaactg gaatctggtc 
ttaaggactt cctccagggt tcaagctgca 
agaagcttgg catttatgag gccatgaaaa 
agttgctgga agcccaagca gctactggct 
taccagtgga ggaagcctac aagagaggtc 
tgtctgcaga acgagctgtc actgggtata 
tgttccaagc catgaataag gaactcatcg 
cacagatcgc aaccgggggg atcattgacc 
tagcatataa gaggggctat ttcaatgagg 
atgataccaa aggatttttt gaccccaaca 
aagaaagatg cattaaggat gaggaaacag 
agaaacaggt gcagacatca caaaagaata 
acccagaaac caataaagaa atgtctgttc 
atgaaacctt caaagaactg tgtgagcagg 
gatcagatgg ctccaccagg gtggtcctgg 
ttcaagatgc tattgacaag ggccttgttg 
gcagcctcag cctcactcaa tttgctgaca 
gcagcagcat gggcagtggt gtcagcgatg 
taagtaagat ttccaccata tccagcgtca 
cagacaccct ggaagaatcg agccccattg 
aaatctccat tacagaaggt atagagcggg 
ttctggaggc tcaggcctgc acaggtggca 
cacttcagga cgcagtctcc cagggtgtga 
ctgctcagaa agccttcata ggcttcgagg 
cagaggcagt gaaagaaaaa tggctcccgt 
agtacctcac gggaggtctt gttgacccgg 
ccatccggaa ggggttcata gatggccgcg 
atgccaaaat cctgacctgc cccaaaacca 
atcgctccat ggtagaagat atcactgggc 



aggtcgaggc cctgaggcgg cagttactcc 5340 
tgaggaatga gcatttccag aaggcgatag 5400 
aaatagaaat tgagaggctg cagtctctca 5460 
tagaagaaga actgcggaac ctgaggctgg 5520 
aagcggacag tgataaaaat gcaaccatct 5580 
acaaccggac cctggaactg caggggctga 5640 
tgagacagga aattgagaaa ttccaaaagc 5700 
aatcaaagaa tcagtgtact caggtggtac 5760 
aagtcctgga gcaagacaag gcaaggctgc 5820 
aatcaactct agaggcagaa accagggtga 5880 
ttcagaatga cctgaatcag tggaagactc 5940 
agatagaatc ggaaagagaa aagagtgaga 6000 
aaagactcca agcagagatc aagagaattg 6060 
ctaccaggga gacacagtca cagttagaaa 6120 
ataaactcag acagcgccca tatgggtccc 6180 
ccgttgacac ctccaagctg gtgtttgatg 6240 
tctatgagtg tcagctgatc gacaaaacaa 6300 
cagtggaaga agttgcttct gaaatccagc 6360 
gagcatctgc ttctcctaag gaaaaatact 6420 
tcagcccaga atccacagtc atgcttctgg 6480 
atccccatcg gaatgagaag ctgactgtcg 6540 
tcgatgaccg tcagcagata tatgcagcag 6600 
tttcaggcaa gacagtatct gtttcagaag 6660 
ccggaatgcg cctgctggaa gcccagattg 6720 
gtgtcttttt gccaaaagat gtcgccttgg 6780 
gatccctgaa tgatccccga gatagtcaga 6840 
aggtcagtta cgtgcagctg aaggaacggt 6900 
tgctttcagt acagaagaga agcatgtcct 6960 
ctgagctagt agattctggt atattgagac 7020 
agatttctta tgacgaggtt ggtgagagaa 7080 
tagcaggcat atacaatgag accacaaaac 7140 
ttggcttagt ccgacctggt actgctctgg 7200 
ttatagtgga tcctgttagc aacttgaggt 7260 
tggtgggcat tgagttcaaa gagaagctcc 7320 
atgatcctga aacaggaaac atcatctctt 7380 
aaaagggcca cggtattcgc ttattagaag 7440 
caaaggagag ccatcgttta ccagttgaca 7500 
aactcagtga gattctctca gatccaagtg 7560 
ctgaagaaaa tcttacctat ctgcaactaa 7620 
ggctctgtct tctgcctctg aaagaaaaga 7680 
ccctcaggaa gcgtagagtg gtcatagttg 7740 
aggaggccta caagaagggc ctaattgatt 7800 
aatgtgaatg ggaagaaata accatcacgg 7860 
tagatagaaa gacaggcagt cagtatgata 7920 
acaggaagtt ctttgatcag taccgatccg 7980 
tgatctcctt gaaaaatggt gtcggcacca 8040 
atgtttttag cagctcccga catgaatcag 8100 
ggaatttaac cataaggagc agctcttttt 8160 
cagccatctt tgacacagaa aacctggaga 8220 
gcatcgttga cagcatcacg ggtcagaggc 8280 
tcatccaccc aaccacgggc cagaagctgt 8340 
ttgaccaaga catggccacc agcgtgaagc 8400 
gtgtgaaggg aaagaagaag atgtcagcag 8460 
atgaggctgg ccagcgcttc ctggagttcc 8520 
aagtgcatgg gaggataagc accgaagaag 8580 
ccgcacagag gctgcaagac accagcagct 8640 
aattaaaaat atcctataag g'atgccataa 8700 
tgcgccttct ggaagccgcc tccgtgtcgt 8760 
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ccaagggctt acccagccct tacaacatgt 
gctcgggatc tcgctccgga tctcgctccg 
"ttgacgccac agggaattct tcctactctt 
ggcactag 



cttcggctcc ggggtcccgc tccggctccc 8820 
ggtcccgcag tgggtcccgg agaggaagct 8880 
attcctactc atttagcagt agttctattg 8940 

8948 



<210> 120 
<211> 587 
' <212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 91, 131, 256, 263, 332, 392, 400, 403, 461, 496, 497, 499, 
510, 511, 518, 519, 539, 554, 560, 576 
<223> n — A, T,C or G 

<400> 120 

cgtcctaagc acttagacta catcagggaa gaacacagac cacatccctg tcctcatgcg 60 
gcttatgttt tctggaagaa agtggagacc nagtccttgg ctttagggct ccccggctgg 120 
gggctgtgca ntccggtcag ggcgggaagg gaaatgcacc gctgcatgtg aacttacagc 180 
ccaggcggat gccccttccc ttagcactac ctggcctcct gcatcccctc gcctcatgtt 240 
cctcccacct tcaaanaatg aanaacccca tgggcccagc cccttgccct ggggaaccaa 300 
ggcagccttc caaaactcag gggctgaagc anactattag ggcaggggct gactttgggt 360 
gacactgccc attccctctc agggcagctc angtcacccn ggnctcttga acccagcctg 420 
ttcctttgaa aaagggcaaa actgaaaagg gcttttccta naaaaagaaa aaccagggaa 480 
ctttgccagg gcttcnntnt taccaaaacn ncttctcnng gatttttaat tccccattng 540 
gcctccactt accnggggcn atgccccaaa attaanaatt tcccatc 587 

<210> 121 
<211> 619 
<212> DNA 

<213> Homo sapiens 

<220> ' 
<221> misc_feature 

<222> 260, 527, 560, 564, 566, 585, 599 
<223> n » A,T,C or G 

<400> 121 

cactagtagg atagaaacac tgtgtcccga gagtaaggag agaagctact attgattaga 60 
gcctaaccca ggttaactgc aagaagaggc gggatacttt cagctttcca tgtaactgta 120 
tgcataaagc caatgtagtc cagtttctaa gatcatgttc caagctaact gaatcccact 180 
tcaatacaca ctcatgaact cctgatggaa caataacagg cccaagcctg tggtatgatg 240 
tgcacacttg ctagactcan aaaaaatact actctcataa atgggtggga gtattttggt 300 
gacaacctac tttgcttggc tgagtgaagg aatgatattc atatattcat ttattccatg 360 
gacatttagt tagtgctttt tatataccag gcatgatgct gagtgacact cttgtgtata 420 
tttccaaatt tttgtacagt cgctgcacat atttgaaatc atatattaag acttccaaaa 480 
aatgaagtcc ctggtttttc atggcaactt gatcagtaaa ggattcncct ctgtttggta 540 
cttaaaacat ctactatatn gttnanatga aattcctttt ccccncctcc cgaaaaaana 600 
aagtggtggg gaaaaaaaa 619 

<210> 122 

<211> 1475 

<212> DNA 

<213> Homo sapiens 
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<400> 122 

tccacctgtc cccgcagcgc cggctcgcgc cctcctgccg cagccaccga gccgccgtct 60 
^gcgccccga cctcgccacc atgagagccc tgctggcgcg cctgcttctc tgcgtcctgg 120 
tcgtgagcga ctccaaaggc agcaatgaac ttcatcaagt tccatcgaac tgtgactgtc 180 
taaatggagg aacatgtgtg tccaacaagt acttctccaa cattcactgg tgcaactgcc 240 
caaagaaatt cggagggcag cactgtgaaa tagataagtc aaaaacctgc tatgagggga 300 
atggtcactt ttaccgagga aaggccagca ctgacaccat gggccggccc tgcctgccct 360 
ggaactctgc cactgtcctt cagcaaacgt accatgccca cagatctgat gctcttcagc 420 
tgggcctggg gaaacataat tactgcagga acccagacaa ccggaggcga ccctggtgct 480 
atgtgcaggt gggcctaaag ccgcttgtcc aagagtgcat ggtgcatgac tgcgcagatg 540 
gaaaaaagcc ctcctctcct ccagaagaat taaaatttca gtgtggccaa aagactctga 600 
ggccccgctt taagattatt gggggagaat tcaccaccat cgagaaccag ccctggtttg 660 
cggccatcta caggaggcac cgggggggct ctgtcaccta cgtgtgtgga ggcagcctca 720 
tcagcccttg ctgggtgatc agcgccacac actgcttcat tgattaccca aagaaggagg 780 
actacatcgt ctacctgggt cgctcaaggc ttaactccaa cacgcaaggg gagatgaagt 840 
ttgaggtgga aaacctcatc ctacacaagg actacagcgc tgacacgctt gctcaccaca 900 
acgacattgc cttgctgaag atccgttcca aggagggcag gtgtgcgcag ccatcccgga 960 
ctatacagac catctgcctg ccctcgatgt ataacgatcc ccagtttggc acaagctgtg 1020 
agatcactgg ctttggaaaa gagaattcta ccgactatct ctatccggag cagctgaaga 1080 
tgactgttgt gaagctgatt tcccaccggg agtgtcagca gccccactac tacggctctg 1140 
aagtcaccac caaaatgctg tgtgctgctg acccacagtg gaaaacagat tcctgccagg 1200 
gagactcagg gggacccctc gtctgttccc tccaaggccg catgactttg actggaattg 1260 
tgagctgggg ccgtggatgt gccctgaagg acaagccagg cgtctacacg agagtctcac 1320 
acttcttacc ctggatccgc agtcacacca aggaagagaa tggcctggcc ctctgagggt 1380 
ccccagggag gaaacgggca ccacccgctt tcttgctggt tgtcattttt gcagtagagt 1440 
catctccatc agctgtaaga agagactggg aagat 14 7 5 

<210> 123 

<211> 2294 

<212> DNA 

<213> Homo sapiens 

<400> 123 

cagcgccggc tcgcgccctc ctgccgcagc caccgagccg ccgtctagcg ccccgacctc 60 
gccaccatga gagccctgct ggcgcgcctg cttctctgcg* tcctggtcgt gagcgactcc 120 
aaaggcagca atgaacttca tcaagttcca tcgaactgtg actgtctaaa tggaggaaca 180 
tgtgtgtcca acaagtactt ctccaacatt cactggtgca actgcccaaa gaaattcgga 240 
gggcagcact gtgaaataga taagtcaaaa acctgctatg aggggaatgg tcacttttac 300 
cgaggaaagg ccagcactga caccatgggc cggccctgcc tgccctggaa ctctgccact 360 
gtccttcagc aaacgtacca tgcccacaga tctgatgctc ttcagctggg cctggggaaa 420 
cataattact gcaggaaccc agacaaccgg aggcgaccct ggtgctatgt gcaggtgggc 480 
ctaaagccgc ttgtccaaga gtgcatggtg catgactgcg cagatggaaa aaagccctcc 540 
tctcctccag aagaattaaa atttcagtgt ggccaaaaga ctctgaggcc ccgctttaag 600 
attattgggg gagaattcac caccatcgag aaccagccct ggtttgcggc catctacagg 660 
aggcaccggg ggggctctgt cacctacgtg tgtggaggca gcctcatcag cccttgctgg 720 
gtgatcagcg ccacacactg cttcattgat tacccaaaga aggaggacta catcgtctac 780 
ctgggtcgct caaggcttaa ctccaacacg caaggggaga tgaagtttga ggtggaaaac 840 
ctaatcctac acaaggacta cagcgctgac acgcttgctc accacaacga cattgccttg 900 
ctgaagatcc gttccaagga gggcaggtgt gcgcagccat cccggactat acagaccatc 960 
tgcctgccct cgatgtataa cgatccccag tttggcacaa gctgtgagat cactggcttt 1020 
ggaaaagaga attctaccga ctatctctat ccggagcagc tgaaaatgac tgttgtgaag 1080 
ctgatttccc accgggagtg tcagcagccc cactactacg gctctgaagt caccaccaaa 1140 
atgctgtgtg ctgctgaccc acagtggaaa acagattcct gccagggaga ctcaggggga 1200 
cccctcgtct gttccctcca aggccgcatg actttgactg gaattgtgag ctggggccgt 1260 
ggatgtgccc tgaaggacaa gccaggcgtc tacacgagag tctcacactt cttaccctgg 1320 
atccgcagtc acaccaagga agagaatggc ctggccctct gagggtcccc agggaggaaa 1380 
cgggcaccac ccgctttctt gctggttgct attttgcagt agagtcatct ccatcagctg 1440 
taagaagagc tgggaatata ggctctgcac agatggattt gcctgtgcca ccaccagggc 1500 
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gaacgacaat agctttaccc tcaggcatag gcctgggtgc tggctgccca gacccctctg 1560 
gccaggatgg aggggtggtc ctgactcaac atgttactga ccagcaactt gtctttttct 1620 
<ggactgaagc ctgcaggagt taaaaagggc agggcatctc ctgtgcatgg gctcgaaggg 1680 
agagccagct cccccgaccg gtgggcattt gtgaggccca tggttgagaa atgaataatt 1740 
tcccaattag gaagtgtaag cagctgaggt ctcttgaggg agcttagcca atgtgggagc 1800 
agcggtttgg ggagcagaga cactaacgac ttcagggcag ggctctgata ttccatgaat 1860 
gtatcaggaa atatatatgt gtgtgtatgt ttgcacactt gtgtgtgggc tgtgagtgta 1920 
agtgtgagta agagctggtg tctgattgtt aagtctaaat atttccttaa actgtgtgga 1980 
ctgtgatgcc acacagagtg gtctttctgg agaggttata ggtcactcct ggggcctctt 2040 
gggtccccca cgtgacagtg cctgggaatg tattattctg cagcatgacc tgtgaccagc 2100 
actgtctcag tttcactttc acatagatgt ccctttcttg gccagttatc ccttcctttt 2160 
agcctagttc atccaatcct cactgggtgg ggtgaggacc actcctgtac actgaatatt 2220 
tatatttcac tatttttatt tatatttttg taattttaaa taaaagtgat caataaaatq 2280 
tgatttttct gatg y 2294 

<210> 124 
<211> 956 
<212> DNA 
<213> Homo sapiens 

<400> 124 

gatgagttcc gcaccaagtt tgagacagac caggccctgc gcctgagtgt ggaggccgac 60 
atcaatggcc tgcgcagggt gctggatgag ctgaccctgg ccagagccga cctggagatg 120 
cagattgaga acctcaagga ggagctggcc tacctgaaga agaaccacga ggaggagatg 180 
aacgccctgc gaggccaggt gggtggtgag atcaatgtgg agatggacgc tgccccaggc 240 
gtggacctga gccgcatcct caacgagatg cgtgaccagt atgagaagat ggcagagaag 300 
aaccgcaagg atgccgagga ttggttcttc agcaagacag aggaactgaa ccgcgaggtg 360 
gccaccaaca gtgagctggt gcagagtggc aagagtgaga tctcggagct ccggcgcacc 420 
atgcaggcct tggagataga gctgcagtcc cagctcagca tgaaagcatc cctggagggc 480 
aacctggcgg agacagagaa ccgctactgc gtgcagctgt cccagatcca ggggctgatt 540 
ggcagcgtgg aggagcagct ggcccagctt cgctgcgaga tggagcagca gaaccaggaa 600 
tacaaaatcc tgctggatgt gaagacgcgg ctggagcagg agattgccac ctaccgccgc 660 
ctgctggagg gagaggatgc ccacctgact cagtacaaga aagaaccggt gaccacccgt 720 
caggtgcgta ccattgtgga agaggtccag gatggcaagg tcatctcctc ccgcgagcag 780 
gtccaccaga ccacccgctg aggactcagc taccccggcc ggccacccag gaggcaggga 840 
cgcagccgcc ccatctgccc cacagtctcc ggcctctcca gcctcagccc cctgcttcag 900 
tcccttcccc atgcttcctt gcctgatgac aataaaagct tgttgactca gctatg 956 

<210> 125 
<211> 486 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 16 

<223> n = A, T, C or G 



60 



<400> 125 

aaattatata tagtgnttca gctcccattg tggtgttcat agtcttctag gaacagataa 
acttaagtat tcaattcact cttggcattt tttctttaat ataggctttt tagcctattt 120 
ttggaaaact gcttttcttc tgagaacctt attctgaatg tcatcaactt taccaaacct 180 
tctaagtcca gagctaactt agtactgttt aagttactat. tgactgaatt ttcttcattt 240 
tctgtttagc cagtgttacc.aaggtaagct ggggaatgaa gtataccaac ttctttcaga 300 
gcattttagg acattatggc agctttagaa ggctgtcttg tttctagcca agggagagcc 360 
agcgcaggtt ttggatacta gagaaagtca tttgcttgta ctattgccat tttagaaagc 420 
tctgatgtga attcaaattt tacctctgtt acttaaagcc aacaatttta aggcagtagt 480 
tttaCt 486 
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<210> 126 

<211> 3552 

<212> DNA 

<213> Homo sapiens 

<400> 126 

cggcaggcag gtctcgtctc ggcaccctcc 
catcccgatg gccgccgctg ggccccggcg 
gctgctgacc ctcgtgatct tcagtcgtgc 
tgtaccttct aaactagagg cagacaaaat 
caggtctgca gacctcatcc ggtcaagtga 
agtgtacaca gccagggctg ttgcgctgtc 
ttctgacaaa aggaaacaga cacagaaaga 
ggtatcgaag acaagacaca ctagagaaac 
acctattcct tgctctatgc aagagaattc 
agttgaatct gatgcagcac agaactatac 
tgataaagaa cctttaaatt tgttttatat 
tcggcctgtg gatcgtgaag aatatgatgt 
agatggatat tcagcagatc tgcccctccc 
caaccaccct gttttcacag aagcaattta 
tggtactaca gtgggggtgg tttgtgccac 
gcgcctgaaa tacagcattt tgcagcagac 
tcccagcaca ggcgtaatca ccacagtctc 
gtactcattg ataatgaaag tacaagacat 
atcaacttgt atcataacag taacagattc 
tgcttatgaa gcatttgtag aggaaaatgc 
agaagataag gatttaatta acactgccaa 
aaatgaaaat ggacatttca aaatcagcac 
tgttgtaaag ccactgaatt atgaagaaaa 
caatgaagcg ccatttgcta gagatattcc 
tacagttcat gtgagggatc tggatgaggg 
gcggattaaa gaaaacttag cagtggggtc 
cgaaaataga aatggcaatg gtttaaggta 
caccattgat gaaatttcag ggtcaatcat 
aactcccaaa aatgagttgt ataatattac 
atgtactgga acacttgctg tgaacattga 
tcaagaatat gtagtcattt gcaaaccaaa 
tcctgatgaa cctgtccatg gagctccatt 
aatcagtaga ctgtggagcc tcaccaaagt 
gaaaaatgct ggatttcaag aatataccat 
agctgcaaca aaattattga gagttaatct 
tgcgacttca aggagtacag gagtaatact 
gggtatagca ctgctctttt ctgtattgct 
taaagggaaa cgttttcctg aagatttagc 
agcacctgga gacgatagag tgtgctctgc 
ctctagccaa ggtttttgtg gtactatggg 
cattgaaatg atgaaaggag gaaaccagac 
tcataccctg gactcctgca ggggaggaca 
ctcggagtgg cacagtttta ctcaaccccg 
gaatgaagac cgcatgccat cccaagatta 
atctccagct ggttctgtgg gctgctgcag 
tttaaataat ttggaaqcca aatttattac 
tcacagtgct acaattaggt ctttgtcaga 
aaagttcaat ttcaacatgt atgtatatga 
ctcaccaatt tatattttta aagcaagttg 
ttaaaacaga caactggtaa atctcaaact 
ctgctctttt ttttttttac agatatttta 



cggcgcccgc gttctcctgg ccctgcccgg 60 
ctccgtgcgc ggagccgtct gcctgcatct 120 
tggtgaagcc tgcaaaaagg tgatacttaa 180 
aattggcaga gttaatttgg aagagtgctt 240 
tcctgatttc agagttctaa atgatgggtc 300 
tgataagaaa agatcattta ccatatggct 360 
ggttactgtg ctgctagaac atcagaagaa 420 
tgttctcagg cgtgccaaga ggagatgggc 480 
cttgggccct ttcccattgt ttcttcaaca 540 
tgtcttctac tcaataagtg gacgtggagt 600 
agaaagagac actggaaatc tattttgcac 660 
ttttgatttg attgcttatg cgtcaactgc 720 
actacccatc agggtagagg atgaaaatga 780 
taattttgaa gttttggaaa gtagtagacc 840 
agacagagat gaaccggaca caatgcatac 900 
accaaggtca cctgggctct tttctgtgca 960 
tcattatttg gacagagagg ttgtagacaa 1020 
ggatggccag ttttttggat tgataggcac 1080 
aaatgataat gcacccactt tcagacaaaa 1140 
attcaatgtg gaaatcttac gaatacctat 1200 
ttggagagtc aattttacca ttttaaaggg 1260 
agacaaagaa actaatgaag gtgttctttc 1320 
ccgtcaagtg aacctggaaa ttggagtaaa 1380 
cagagtgaca gccttgaaca gagccttggt 1440 
gcctgaatgc actcctgcag cccaatatgt 1500 
aaagatcaac ggctataagg catatgaccc 1560 
caaaaaattg catgatccta aaggttggat 1620 
aacttccaaa atcctggata gggaggttga 1680 
agtcctggca atagacaaag atgatagatc 1740 
agatgtaaat gataatccac cagaaatact 1800 
aatggggtat accgacattt tagctgttga 1860 
ttatttcagt ttgcccaata cttctccaga 1920 
taatgataca gctgcccgtc tttcatatca 1980 
tcctattact gtaaaagaca gggccggcca 2040 
gtgtgaatgt actcatccaa ctcagtgtcg 2100 
tggaaaatgg gcaatccttg caatattact 2160 
aactttagta tgtggagttt ttggtgcaac 2220 
acagcaaaac ttaattatat caaacacaga 2280 
caatggattt atgacccaaa ctaccaacaa, 2340 
atcaggaatg aaaaatggag ggcaggaaac 2400 
cttggaatcc tgccgggggg ctgggcatca 2460 
cacggaggtg gacaactgca gatacactta 2520 
tctcggtgaa aaattgcatc gatgtaatca 2580 
tgtcctcact tataactatg agggaagagg 2640 
tgaaaagcag gaagaagatg gccttgactt 2700 
attagcagaa gcatgcacaa agagataatg 2760 
cattctggag gtttccaaaa ataatattgt 2820 
tgattttttt ctcaattttg aattatgcta 2880 
ttgcttatct tttccaaaaa gtgaaaaatg 2940 
ccagcactgg aattaaggtc tctaaagcat 3000 
gtaataaata tgctggataa atattagtcc 3060 
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aacaatagct aagttatgct aatatcacat tattatgtat tcactttaag tgatagttta 3120 
aaaaataaac aagaaatatt gagtatcact atgtgaagaa agttttggaa aagaaacaat 3180 
"gaagactgaa ttaaattaaa aatgttgcag ctcataaaga attggactca cccctactgc 3240 
actaccaaat tcatttgact ttggaggcaa aatgtgttga agtgccctat gaagtagcaa 3300 
ttttctatag gaatatagtt ggaaataaat gtgtgtgtgt atattattat taatcaatgc 3360 
aatatttaaa tgaaatgaga acaaagagga aaatggtaaa aacttgaaat gaggctgggg 3420 
tatagtttgt cctacaatag aaaaaagaga gagcttccta ggcctgggct cttaaatgct 3480 
gcattataac tgagtctatg aggaaatagt tcctgtccaa tttgtgtaat ttgtttaaaa 3540 
ttgtaaataa at 35 52 

<210> 127 
<211> 754 
<212> DNA 
<213> Homo sapiens 

<400> 127 

tttttttttt ttgtcattgt tcattgattt taatgagaaa gctaagagag gaaataagta 60 
gcctttcaaa ggtcacacag aagtaagtga cagatccagg attcatatcc aagcattctg 120 
gctctagtgt ccatgcttct caaccattat gacccaatat tcaaccaaat caatactgaa 180 
ggacacgtga aatgtatccg gtattttact attacaaaca aaaatccaat gaacattctt 240 
gaagacatac acaaaaataa tggttacaat agaagttact ggaattgaaa ttttggttca 300 
acctatatta aaatgtaagg cttttgatat agctaataga tttttgaaat gatcagtctt 360 
aacgtttgta ggggagcaca ctcctgcatg gggaaaagat tcactgtgaa gcacagagca 420 
cctttatggt tggatcatct tgtcattaaa gttcaggcgt tatctatcct gtaagtggca 480 
gaatcaagac tgcaatatcg cctgcttttc tttttaactc atgttttccc ttgactacac 540 
tggtcctcaa agtaaaaccc ctgtgtcagt gtactattca tggaatactc tgcaattata 600 
accaccttct aatactttta atacccaatc aaaatttatt atacatatgt atcatagata 660 
ctcatctgta aagctgtgct tcaaaatagt gatctcttcc caacattaca atatatatta 720 
atgatgtcga acctgcccgg gcggccgctc gaag 754 

<210> 128 
<211> 374 
<212> DNA 
<213> Homo sapiens 

<400> 128 

aggttttgat taaaaaggca aatgatttta ttgttcgata atcttttaaa aaaataagag 60 

gaaggagtaa aattaaagat gaaagatgat ttttatttcc ttgtgacctc tatatccccc 120 

ttcccctgcc cttggtaagt aactcttgat ggagaaagga ttaaagactc ttatttaacc 180 

aaaaaacaga gccagctaat catttccaaa ggttagtatc tccctgctga cctcttcttt 240 

ggtttaattg aataaaacta tatgttcata tatgtattaa aacaactcag aataacatct 300 

tttcttcctt agttaaggca ttataagggc tatactatca tccataataa ccaaggcaat 360 
aacttaaaaa gctg 374 

<210> 129 
<211> 546 
<212> DNA 

<213> Homo sapiens 
<400> 129 

agtgtgatgg atatctgcag aattcgggct aagcgtggtc gcggcccgag gtctggaact 60 
tcccagcacy tgaaaaggag cctcctgagc tgactcggct aaagccccac tttcgctcct 120 
cctcatttct gcctactgat ttccttggag cattcatctg aatattaccg tttgctgtgt 180 
aacctggtac atacatagca tgactccctg gaatagagtg ggctggggtg cttatgctgg 240 
gagagtgatt gacatgcact ttcaagctat atctaccatt tgcagcaaag gagaaaaaat 300 
acctcgagta aattccatca ttttttataa catcagcacc tgctccatca tcaaggagtc 360 
tcagcgtaac aggatctcca gtctctggct caactgtggc agtgacagtg gcattaagaa 420 
tgggataaaa tccctgtttc acattggcat aaatcatcac aggatgagga aaatggaggc 480 
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tgtctctttc cacaaaggct tccacagtgg ctgggggcac agacctgccc gggcggccgc 540 
tcgaaa 546 

<210> 130 
<211> 5156 
<212> DNA 

<213> Homo sapiens 
<400> 130 

accaaccgag gcgccgggca gcgacccctg cagcggagac agagactgag cggcccggca 60 
ccgccatgcc tgcgctctgg ctgggctgct gcctctgctt gtcgctcctc ctgcccgcag 120 
cccgggccac ctccaggagg gaagtctgtg attgcaatgg gaagtccagg cagtgtatct 180 
ttgatcggga acttcacaga caaactggta atggattccg ctgcctcaac tgcaatgaca 240 
acactgatgg cattcactgc gagaagtgca agaatggctt ttaccggcac agagaaaggg 300 
accgctgttt gccctgcaat tgtaactcca aaggttctct tagtgctcga tgtgacaact 360 
ccggacggtg cagctgtaaa ccaggtgtga caggagccag atgcgaccga tgtctgccag 420 
gcttccacat gctcacggat gcggggtgca cccaagacca gagactgcta gactccaagt 480 
gtgactgtga cccagctggc atcgcagggc cctgtgacgc gggccgctgt gtctgcaagc 540 
cagctgtcac tggagaacgc tgtgataggt gtcgatcagg ttactataat ctggatgggg 600 
ggaaccctga gggctgtacc cagtgtttct gctatgggca ttcagccagc tgccgcagct 660 
ctgcagaata cagtgtccat aagatcacct ctacctttca tcaagatgtt gatggctgga 720 
aggctgtcca acgaaatggg tctcctgcaa agctccaatg gtcacagcgc catcaagatg 780 
tgtttagctc agcccaacga ctagaccctg tctattttgt ggctcctgcc aaatttcttg 840 
ggaatcaaca ggtgagctat ggtcaaagcc tgtcctttga ctaccgtgtg gacagaggag 900 
gcagacaccc atctgcccat gatgtgattc tggaaggtgc tggtctacgg atcacagctc 960 
ccttgatgcc acttggcaag acactgcctt gtgggctcac caagacttac acattcaggt 1020 
taaatgagca tccaagcaat aattggagcc cccagctgag ttactttgag tatcgaaggt 1080 
tactgcggaa tctcacagcc ctccgcatcc gagctacata tggagaatac agtactgggt 1140 
acattgacaa tgtgaccctg atttcagccc gccctgtctc tggagcccca gcaccctggg 1200 
ttgaacagtg tatatgtcct gttgggtaca aggggcaatt ctgccaggat tgtgcttctg 1260 
gctacaagag agattcagcg agactggggc cttttggcac ctgtattcct tgtaactgtc 1320 
aagggggagg ggcctgtgat ccagacacag gagattgtta ttcaggggat gagaatcctg 1380 
acattgagtg tgctgactgc ccaattggtt tctacaacga tccgcacgac ccccgcagct 1440 
gcaagccatg tccctgtcat aacgggttca gctgctcagt gatgccggag acggaggagg 1500 
tggtgtgcaa taactgccct cccggggtca ccggtgcccg ctgtgagctc tgtgctgatg 1560 
gctactttgg ggaccccttt ggtgaacatg gcccagtgag gccttgtcag ccctgtcaat 1620 
gcaacaacaa tgtggacccc agtgcctctg ggaattgtga ccggctgaca ggcaggtgtt 1680 
tgaagtgtat ccacaacaca gccggcatct actgcgacca gtgcaaagca ggctacttcg 1740 
gggacccatt ggctcccaac ccagcagaca agtgtcgagc ttgcaactgt aaccccatgg 1800 
gctcagagcc tgtaggatgt cgaagtgatg gcacctgtgt ttgcaagcca ggatttggtg 1860 
gccccaactg tgagcatgga gcattcagct gtccagcttg ctataatcaa gtgaagattc 1920 
agatggatca gtttatgcag cagcttcaga gaatggaggc cctgatttca aaggctcagg 1980 
gtggtgatgg agtagtacct gatacagagc tggaaggcag gatgcagcag gctgagcagg 2040 
cccttcagga cattctgaga gatgcccaga tttcagaagg tgctagcaga tcccttggtc 2100 
tccagttggc caaggtgagg agccaagaga acagctacca gagccgcctg gatgacctca 2160 
agatgactgt ggaaagagtt cgggctctgg gaagtcagta ccagaaccga gttcgggata 2220 
ctcacaggct catcactcag atgcagctga gcctggcaga aagtgaagct tccttgggaa 2280 
acactaacat tcctgcctca gaccactacg tggggccaaa tggctttaaa agtctggctc 2340 
aggaggccac aagattagca gaaagccacg ttgagtcagc cagtaacatg gagcaactga 2400 
caagggaaac tgaggactat tccaaacaag ccctctcact ggtgcgcaag gccctgcatg 2460 
aaggagtcgg aagcggaagc ggtagcccgg acggtgctgt ggtgcaaggg Qttgtggaaa 2520 
aattggagaa aaccaagtcc ctggcccagc agttgacaag ggaggccact caagcggaaa 2580 
ttgaagcaga taggtcttat cagcacagtc tccgcctcct ggattcagtg tctcggcttc 2640 
agggagtcag tgatcagtcc tttcaggtgg aagaagcaaa gaggatcaaa caaaaagcgg 2700 
attcactctc aagcctggta accaggcata tggatgagtt caagcgtaca cagaagaatc 2760 
tgggaaactg gaaagaagaa gcacagcagc tcttacagaa tggaaaaagt gggagagaga 2820 
aatcagatca gctgctttcc cgtgccaatc ttgctaaaag cagagcacaa gaagcactga 2880 
gtatgggcaa tgccactttt tatgaagttg agagcatcct taaaaacctc agagagtttg 2940 
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acctgcaggt ggacaacaga aaagcagaag ctgaagaagc catgaagaga ctctcctaca 3000 
tcagccagaa ggtttcagat g.ccagtgaca agacccagca agcagaaaga gccctgggga 3060 
- gcgctgctgc tgatgcacag agggcaaaga atggggccgg ggaggccctg gaaatctcca 3120 
gtgagattga acaggagatt gggagtctga acttggaagc caatgtgaca gcagatggag 3180 
ccttggccat ggaaaaggga ctggcctctc tgaagagtga gatgagggaa gtggaaggag 3240 
agctggaaag gaaggagctg gagtttgaca cgaatatgga tgcagtacag atggtgatta 3300 
cagaagccca gaaggttgat accagagcca agaacgctgg ggttacaatc caagacacac 3360 
tcaacacatt agacggcctc ctgcatctga tggaccagcc tctcagtgta gatgaagagg 3420 
ggctggtctt actggagcag aagctttccc gagccaagac ccagatcaac agccaactgc 3480 
ggcccatgat gtcagagctg gaagagaggg cacgtcagca gaggggccac ctccatttgc 3540 
tggagacaag catagatggg attctggctg atgtgaagaa cttggagaac attagggaca 3600 
acctgccccc aggctgctac aatacccagg ctcttgagca acagtgaagc tgccataaat 3660 
atttctcaac tgaggttctt gggatacaga tctcagggct cgggagccat gtcatgtgag 3720 
tgggtgggat ggggacattt gaacatgttt aatgggtatg ctcaggtcaa ctgacctgac 3780 
cccattcctg atcccatggc caggtggttg tcttattgca ccatactcct tgcttcctga 3840 
tgctgggcaa tgaggcagat agcactgggt gtgagaatga tcaaggatct ggaccccaaa 3900 
gaatagactg gatggaaaga caaactgcac aggcagatgt ttgcctcata atagtcgtaa 3960 
gtggagtcct ggaatttgga caagtgctgt tgggatatag tcaacttatt ctttgagtaa 4020 
tgtgactaaa ggaaaaaact ttgactttgc ccaggcatga aattcttcct aatgtcagaa 4080 
cagagtgcaa cccagtcaca ctgtggccag taaaatacta ttgcctcata ttgtcctctg 4140 
caagcttctt gctgatcaga gttcctccta cttacaaccc agggtgtgaa catgttctcc 4200 
attttcaagc tggaagaagt gagcagtgtt ggagtgagga cctgtaaggc aggcccattc 4260 
agagctatgg tgcttgctgg tgcctgccac ettcaagttc tggacctggg catgacatcc 4320 
tttcttttaa tgatgccatg gcaacttaga gattgcattt ttattaaagc atttcctacc 4380 
agcaaagcaa atgttgggaa agtatttact ttttcggttt caaagtgata gaaaagtgtg 4440 
gcttgggcat tgaaagaggt aaaattctct agatttatta gtcctaattc aatcctactt 4500 
ttagaacacc aaaaatgatg cgcatcaatg tattttatct tattttctca atctcctctc 4560 
tctttcctcc acccataata agagaatgtt cctactcaca cttcagctgg gtcacatcca 4620 
tccctccatt catccttcca tccatctttc catccattac ctccatccat ccttccaaca 4 680 
tatatttatt gagtacctac tgtgtgccag gggctggtgg gacagtggtg acatagtctc 4740 
tgccctcata gagttgattg tctagtgagg aagacaagca tttttaaaaa ataaatttaa 4800 
acttacaaac tttgtttgtc acaagtggtg tttattgcaa taaccgcttg gtttgcaacc 4860 
tctttgctca acagaacata tgttgcaaga ccctcccatg ggggcacttg agttttggca 4920 
aggctgacag agctctgggt tgtgcacatt tctttgcatt ccagctgtca ctctgtgcct 4980 
ttctacaact gattgcaaca gactgttgag ttatgataac accagtggga attgctggag 5040 
gaaccagagg cacttccacc ttggctggga agactatggt gctgccttgc ttctgtattt 5100 
ccttggattt tcctgaaagt gtttttaaat aaagaacaat tgttagaaaa aaaaaa 5156 



<210> 131 
<211> 671 
<212> DNA 
<213> Homo sapiens 

<400> 131 

aggtctggag ggcccacagc cggatgtggg 
ttttgcatcc cggttgcagt gtgttgcaga 
cctgggcagc caycacgagg atcatgactc 
tcccgatgct ggtggagtgt ttgttgacac 
ccattgcgct ggtttatccc tgagtcctgt 
agaatgaggg caagatccct ctgcgagggt 
tgcctagaag ccaatgggtg cacagtgatg 
aggatgtcgc ctggaatatt caaattgaat 
gaatttttct ttcgccatac agaaattgtt 
cctgaccctt cctgctcccc aggaagggag 
cgtgacaccg gagacaggtc ttcttcaccg 
ttaactgcta t 



acaccgggaa aaagtggtca tagcacacat 60 
cgaagtcctc ttgctcgtca ccccacactt 120 
ggaaaataaa gatgactgtg atccacacct 180 
ccccgatgaa agtgtgcagc gtcccccaat 240 
ttccaacgac tgccagtgtt tcagacccaa 300 
ttcagacctc cttctcctac cccactggag 360 
atacgaatgt caatctttgc tcggtcagtg 420 
tacagatgca tgaagagggc gtacaagtta 480 
tagccagatc ttctgtactt cttttccttc 540 
gtcagccccg tttgcaaaac acaggatgcc 600 
acaggaagtg ccttctggtg cctgcacgtt 660 

671 
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<210> 132 
<<211> 590 
<212> DMA 

<213> tlomo sapiens 
<400> 132 

ctgaatggaa aagcttatgg ctctgtgatg 
cttggcaatt gcttacccac tgtgctcagc 
f ggttcatctg cagccccaaa tctggaggaa 
tttgttccag atatatcaaa ctccaatagc 
ggaactggag acattttcca gcaacatatt 
cctcaccatc aattgaaaaa cacagtgact 
tttctagtta cgtggcaggc cagtggtcct 
cgaaaatact acacaaataa ttttatcacc 
attccaggaa cagctaagcc tgggcactgg 
ctgcaagccc tgaaagtgac agtgacctct 

<210> 133 
<211> 581 
<212> DNA 
<213> Homo sapiens 



atattagtga ccagcggaga tgataagctt 60 
agtggttcaa caattcactc cattgccctg 120 
ttatcacgtc ttacaggagg tttaaagttc 180 
atgattgatg ctttcagtag aatttcctct 240 
cagcttgaaa gtacaggtga aaatgtcaaa 300 
gtggataata ctgtgggcaa cgacactatg 360 
cctgagatta tattatttga tcctgatgga 420 
aatctaactt ttcggacagc tagtctttgg 480 
acttacaccc tgaacaatac ccatcattct 540 
cgcgcctcca actcagacct 590 



<400> 133 

aggtcctgtc cgggggcact gagaactccc 
actgtgggcc tggagataaa acttgtctcc 
ctgtcctcat ctctgcaaag ttcagcttcc 
atgctctggg gagctcatgg gtggaggagt 
tgggccaggg atgaatattt gagggataaa 
gggggagaac agagaggagc tgggctatgg 
tggctggata tctggtacta aaaaagggtc 
aatccaaacc atagctgtct gtccagtgct 
cctcctagac tctgtccctg ggctagggca 
ggctgaggag agtgtgacat gtggggagag 

<210> 134 

<211> 4797 

<212> DNA 

<213> Homo sapiens 



tctggaattc ttggggggtg ttggggagag 60 
tctaccacca ccctgtaccc tagcctgcac 120 
ttccccaggt ctctgtgcac tctgtcttgg 180 
ctccaccaga gggaggctca ggggactggt 240 
aattgtgtaa gagccaaaga attggtagta 300 
gaaatgattt gaataatgga gctgggaata 360 
tttaagaacc tacttcctaa tctcttcccc 420 
ctcttcctgc ctccagctct gccccaggct 480 
ggggaggagg gagagcaggg ttgggggaga 540 
gaccagacct c 581 



<220> 

<221> misc_feature 

<222> 135, 501, 4421, 4467, 4468, 

<223> n = A,T,C or G 

<400> 134 

cctgggacca aagtgctgcc cagagctgag 
ctgtgtacct gtgcagcaca gggtagggtg 
ggagcagcag agacncgcca agcctttact 
ttgtggctac taatttgccc cctgaagatc 
tctccggctc aggtgcaggt gaggttgtca 
ggtcatgcct gggggcagtg gtcaggcagt 
caccctgcct gccctgtctc cacccagctg 
ggggtcgtga gctgctgtta aggagagctc 
tccagaaggc cccagggcgc nctgctgcac 
ctcactaaga aacctctgga acccccttca 
tctcatctgc aaaatgggaa taataccttg 
cacagagcca gctggggtgt agctcttcca 



4 698 

\ 

ggtcctggag ccacatgaga aggcttctcc 60 
agtccactca gctgtctagg agaggaccca 120 
cataccatat tctgatcctt ttccagcaaa 180 
aagatggctc tggggatgac tctgacaact 240 
tgggggcccc ccccacccaa gacggcaaca 300 
ctcctgtgtt tactgagcat gtactgagtg 360 
gctccaaagg gcaatgctga ggagaggaat 420 
atgcttggag gtgaggtgaa ggctgtgagc 480 
gcaggctcat attcactagg aatagcttta 540 
gaaggttatt tgactcctga gcctctattt 600 
acctgataag cttgtggagc tgtaaggcag 660 
tccaagctcc cttccttact tcccctttcc 720 
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tgtggggact gggggagaga agtccctgag ctggaggtgg tcagggaagc ttcacagagg 780 
aggtggctct tgagtggacc tcaggaagag gggtgagaga gctaaggaag gaggctgagg 840 
^tcatccctgg ggaagtgacc tagcggaggc ctgagagctg caaggtagga tatctgttgt 900 
tggaagtgtc tgttgttgga agtgggggcc tttttttcag ggagggtggg gccagagaag 960 
tgtgtgccct gggataagta ggataaccac agtagttatg cccctaaggg atgcccaccc 1020 
cacccctgtg gtcacagaaa agctttccca ggtggcctag gcacctgtct cgtggctcca 1080 
gagacaggct gcacctgaca cacacaatgg aaggacagct ctccttgtcc attttccaag 1140 
gagcttagcc tcagctgcct tgtccaggta ctagcctccc tcatagcctg agcttggcca 1200 
gcccaggtgc tctggagcct cccccgaccc acccaacaca ctctgcttct ggtcctcccc 1260 
accccccacc tccccaacac actctgcttc tggtcctgca , ggtgctttgc aagatatcac 1320 
cttgtcacag cagaccccct ccacttggaa ggacacgcag ctcctgacgg ctattcccac 1380 
gtctccagaa cccaccggcc tggaggctac agctgcctcc acctccaccc tgccggctgg 1440 
agaggggccc aaggagggag aggctgtagt cctgccagaa gtggagcctg gcctcaccgc 1500 
ccgggagcag gaggccaccc cccgacccag ggagaccaca cagctcccga ccactcatca 1560 
ggcctcaacg accacagcca ccacggccca ggagcccgcc acctcccacc cccacaggga 1620 
catgcagcct ggccaccatg agacctcaac ccctgcagga cccagccaag ctgaccttca 1680 
cactccccac acagaggatg gaggtccttc tgccaccgag agggctgctg aggatggagc 1740 
ctccagtcag ctcccagcag cagagggctc tggggagcag gtgagtggcc tctgcattcc 1800 
ttgggaaatt gagtgggttg gtcctaatgc ctggcacttg gcaggcccta cacctgtgcc 1860 
ctgcgcgatc tcgtattcct caccaggaag acagggcaca ggggccgcct tcccctaccc 1920 
ccagggcctc gcagagcagg acagactaac tatgagatca gagcagaagc acccttaaag 1980 
atcacccaag agagggctcc caaactcaca atccaaactt gcagccctcg tcgaagagtg 2040 
aacgttatac cagtcatttt atttatagct tcgtggattt acgcttacac taaatagtct 2100 
gctattcata caaaatgtgt gctttgtatc actttttgtg atatccatgc catggtccag 2160 
ccagggtccg gagttgatgt ggcaagaagg cctggctttc gggccctgtg cgatcctggt 2220 
ttgggtgcat ctgagtgggt ggtggcaaag atcagggagg caggagctgc ttctgggtct 2280 
gtagtggagc tggttgctgc tgctggcggt gacctggcca acccaatctg cccctgccct 2340 
cccacaggac ttcacctttg aaacctcggg ggagaatacg gctgtagtgg ccgtggagcc 2400 
tgaccgccgg aaccagtccc cagtggatca gggggccacg ggggcctcac agggcctcct 2460 
ggacaggaaa gaggtgctgg gaggtgagtt ttctttcagg ggggtagttt ggggtgaatt 2520 
gctgctgtgg ggtcagggtg gggctgacca cagccaaggc cactgctttg ggagggtctg 2580 
cacgagagcc caaggagccg ctgagctgag ctggccccgt ctacctgccc taggggtcat 2640 
tgccggaggc ctcgtggggc tcatctttgc tgtgtgcctg gtgggtttca tgctgtaccg 2700 
catgaagaag aaggacgaag gcagctactc cttggaggag ccgaaacaag ccaacggcgg 2760 
ggcctaccag aagcccacca aacaggagga attctatgcq tgacgcggga gccatgcgcc 2820 
ccctccgccc tgccactcac taggccccca cttgcctctt ccttgaagaa ctgcaggccc 2880 
tggcctcccc tgccaccagg ccacctcccc agcattccag cccctctggt cgctcctgcc 2940 
cacggagtcg tgggtgtgct gggagctcca ctctgcttct ctgacttctg cctggagact 3000 
tagggcacca ggggtttctc gcataggacc tttccaccac agccagcacc tggcatcgca 3060 
ccattctgac tcggtttctc caaactgaag cagcctctcc ccaggtccag ctctggaggg 3120 
gagggggatc cgactgcttt ggacctaaat ggcctcatgt ggctggaaga tcctgcgggt 3180 
ggggcttggg gctcacacac ctgtagcact tactggtagg accaagcatc ttgggggggt 3240 
ggccgctgag tggcagggga caggagtcac tttgtttcgt ggggaggtct aatctagata 3300 
tcgacttgtt tttgcacatg tttcctctag ttctttgttc atagcccagt agaccttgtt 3360 
acttctgagg taagttaagt aagttgattc. ggtatccccc catcttgctt ccctaatcta 3420 
tggtcgggag acagcatcag ggttaagaag actttttttt ttttttttaa actaggagaa 3480 
ccaaatctgg aagccaaaat gtaggcttag tttgtgtgtt gtctcttgag tttgtcgctc 3540 
atgtgtgcaa cagggtatgg actatctgtc tggtggcccc gttctggtgg tctgttggca 3600 
ggctggccag tccaggctgc cgtggggccg ccgcctcttt caagcagtcg tgcctgtgtc 3660 
catgcgctca gggccatgct gaggcctggg ccgctgccac gttggagaag cccgtgtgag 3720 
aagtgaatgc tgggactcag ccttcagaca gagaggactg tagggagggc gpcaggggcc 3780 
tggagatcct cctgcaggct cacgcccgtc ctcctgtggc gccgtctcca ggggctgctt 3840 
cctcctggaa attgacgagg ggtgtcttgg gcagagctgg ctctgagcgc ctccatccaa 3900 
ggccaggttc tccgttagct cctgtggccc caccctgggc cctgggctgg aatcaggaat 3960 
attttccaaa gagtgatagt cttttgcttt tggcaaaact ctacttaatc caatgggttt 4020 
ttccctgtac agtagatttt ccaaatgtaa taaactttaa tataaagtag tctgtgaatg 4080 
ccactgcctt cgcttcttgc ctctgtgctg tgtgtgacgt gaccggactt ttctgcaaac 4140 
accaacatgt tgggaaactt ggctcgaatc tctgtgcctt cgtctttccc atggggaggg 4200 
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attctggttc cagggtccct ctgtgtattt 
aggtcggtag gttcagccaa ggttttataa 
ccaagcccat cttctaaatg gcaaaggaag 
gctgtggtca cagcggaggt gtggagccga 
cctcccaccc ggctgcagag gccaganncc 
tgacaacgtt tcagcgactc cgttggccac 
atgcaccacc aagccaaggg aacctgtgtc 
agtgtatgac tgcacatgac tcgggggtgg 
ctggtccgtg ggacggtncc caagccagag 
gtacttgtca tttcgggcaa cggctgctgt 

<210> 135 
<211> 2856 
<212> DNA 

<213> Homo sapiens 
<400> 135 

tagtcgcggg tccccgagtg agcacgccag 
gtcagagtcg cagtgggagt ccccggaccg 
cgcacgcccg tcgccacccg cgtacccggc 
tggagcccag cagcaagaag ctgacgggtc 
ttggctccct gcagtttggc tacaacactg 
aggagttcta caaccagaca tgggtccacc 
tcaccacgct ctggtccctc tcagtggcca 
tctctgtggg ccttttcgtt aaccgctttg 
tgctggcctt cgtgtccgcc gtgctcatgg 
tgctgatcct gggccgcttc atcatcggtg 
ccatgtatgt gggtgaagtg tcacccacag 
agctgggcat cgtcgtcggc atcctcatcg 
gcaacaagga cctgtggccc ctgctgctga 
gcatcgtgct gcccttctgc cccgagagtc 
agaaccgggc caagagtgtg ctaaagaagc 
tgcaggagat gaaggaagag agtcggcaga 
agctgttccg ctcccccgcc taccgccagc 
cccagcagct gtctggcatc aacgctgtct 
cgggggtgca gcagcctgtg tatgccacca 
ctgtcgtgtc gctgtttgtg gtggagcgag 
tcgctggcat ggcgggttgt gccatactca 
taccctggat gtcctatctg agcatcgtgg 
tgggtcctgg ccccatccca tggttcatcg 
cagctgccat tgccgttgca ggcttctcca 
gcttccagta tgtggagcaa ctgtgtggtc 
tggttctgtt cttcatcttc acctacttca 
atgagatcgc ttccggcttc cggcaggggg 
agctgttcca tcccctgggg gctgattccc 
gcctgctccc agcagcccta aggatctctc 
aacctgacag atgtcagccg agccgggcct 
ccagaagaat attcaggact taacggctcc 
aaatctattc agacaagcaa caggttttat 
atatcagcct gagtctcctg tgcccacatc 
gagggtggag actaagccct gtcgagacac 
ctggacctat gtcctaagga cacactaatc 
gaggtggcta tggccacccg ttctgctggc 
cattaggatt tgccccttcc catctcttcc 
cctgagacca gttgggagca ctggagtgca 
gccgggttct agtctccttt gcactgaggg 
gggagcctgc aaactcactg ctcaagaaga 
tgcaagatat ttatatatat ttttggttgt 



gcttttttgt tttggctgaa attctcctgg 4260 
ggctgatgtc aatttctgtg ttgccaagct 4320 
gtggatggcc ccagcacagc ttgacctgag 4380 
ggcctacccc ncagacacct tggacatcct 4440 
agcccagggt cctgcactta cttgcttatt 4500 
tccgagagtg ggccagtctg tggatcagag 4560 
cggtattcga tactgcgact ttctgcctgg 4 620 
ggaaaggggt cggctgacca tgctcatctg 4680 
gtgggttcat ttgtgtaacg acaataaacg 4740 
ggtggtggtt gagtctcttc ttggcct 4797 



ggagcaggag accaaacgac gggggtcgga 60 
gagcacgagc ctgagcggga gagcgccgct 120 
gcagccagag ccaccagcgc agcgctgcca 180 
gcctcatgct ggctgtggga ggagcagtgc 240 
gagtcatcaa tgccccccag aaggtgatcg 300 
gctatgggga gagcatcctg cccaccacgc 360 
tcttttctgt tgggggcatg attggctcct 420 
gccggcggaa ttcaatgctg atgatgaacc 480 
gcttctcgaa actgggcaag tcctttgaga 540 
tgtactgcgg cctgaccaca ggcttcgtgc 600 
cctttcgtgg ggccctgggc accctgcacc 660 
cccaggtgtt cggcctggac tccatcatgg 720 
gcatcatctt catcccggcc ctgctgcagt 780 
cccgcttcct gctcatcaac cgcaacgagg 840 
tgcgcgggac agctgacgtg acccatgacc 900 
tgatgcggga gaagaaggtc accatcctgg 960 
ccatcctcat cgctgtggtg ctgcagctgt 1020 
tctattactq cacgagcatc ttcgagaagg 1080 
ttggctccgg tatcgtcaac acggccttca 1140 
caggccggcg gaccctgcac ctcataggcc 1200 
tgaccatcgc gctagcactg ctggagcagc 1260 
ccatctttgg ctttgtggcc ttctttgaag 1320 
tggctgaact cttcagccag ggtccacgtc 1380 
actggacctc aaatttcatt gtgggcatgt 1440 
cctacgtctt catcatcttc actgtgctcc 1500 
aagttcctga gactaaaggc cggaccttcg 1560 
gagccagcca aagtgataag acacccgagg 1620 
aagtgtgagt cgccccagat caccagccog 1680 
aggagcacag gcagctggat gagacttcca 1740 
ggggctcctt tctccagcca gcaatgatgt 1800 
aggattttaa caaaagcaag actgttgctc 1860 
aattttttta ttactgattt tgttattttt 1920 
ccaggcttca ccctgaatgg ttccatgcct 1980 
ttgccttctt cacccagcta a,tctgtaggg 2040 
gaactatgaa ctacaaagct tctatcccag 2100 
ctggatctcc ccactctagg ggtcaggctc 2160 
tacccaacca ctcaaattaa tctttcttta 2220 
gggaggagag gggaagggcc agtctgggct 2280 
ccacactatt accatgagaa gagggcctgt 2340 
catggagact cctgccctgt tgtgtataga 2400 
caatattaaa tacagacact aagttatagt 2460 
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atatctggac aagccaactt gtaaatacac cacctcactc ctgttactta cctaaacaga 2520 

tataaatggc tggtttttag aaacatggtt ttgaaatgct tgtggattga gggtaggagg 2580 

;tttggatggg agtgagacag aagtaagtgg ggttgcaacc actgcaacgg cttagacttc 2640 

gactcaggat ccagtccctt acacgtacct ctcatcagtg tcctcttgct caaaaatctg 2700 

tttgatccct gttacccaga gaatatatac attctttatc ttgacattca aggcatttct 2760 

atcacatatt tgatagttgg tgttcaaaaa aacactagtt ttgtgccagc cgtgatgctc 2820 

aggcttgaaa tcgcattatt ttgaatgtga agggaa 2856 



<210> 136 
<211> 356 
<212> DNA 

<213> Homo sapiens 



<400> 136 

ggtggagcca aatgaagaaa atgaagatga aagagacaga cacctcagtt tttctggatc 60 
aggcattgat gatgatgaag attttatctc cagcaccatt tcaaccacac cacgggcttt 120 
tgaccacaca aaacagaacc aggactggac tcagtggaac ccaagccatt caaatccgga 180 
agtgctactt cagacaacca caaggatgac tgatgtagac agaaatggca ccactgctta 240 
tgaaggaaac tggaacccag aagcacaccc tcccctcatt caccatgagc atcatgagga 300 
agaagagacc ccacattcta caagcacaat ccaggcaact cctagtagta caacgg 356 

<210> 137 
<211> 356 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 254, 264, 279, 281, 290, 328, 342 
<223> n = A,T,C or G 

<400> 137 

gcaggtggag aagacatttt attgttcctg gggtctctgg aggcccattg gtggggctgg 60 

gtcactggct gcccccggaa cagggcgctg ctccatggct ctgcttgtgg tagtctgtgg 120 

ctatgtctcc cagcaaggac agaaactcag aaaaatcaat cttcttatcc tcattcttgt 180 

cctttttctc aaagacatcg gcgaggtaat ttgtgccctt 1 tttacctcgg cccgcgacca 240 

cgctaaggcc aaanttccag acanayggcc gggccggtnc nataggggan cccaacttgg 300 

ggacccaaac tctggcgcgg aaacacangg gcataagctt gnttcctgtg gggaaa 356 

<210> 138 
<211> 353 
<212> DNA 

<213> Homo sapiens 
<400> 138 

aggtccagtc ctccacttgg cctgatgaga gtggggagtg gcaagggacg tttctcctgc 60 
aatagacact tagatttctc tcttgtggga agaaaccacc tgtccatcca ctgactcttc 120 
tacattgatg tggaaattgc tgctgctacc accacctcct gaagaggctt ccctgatgcc 180 
aatgccagcc atcttggcat cctggccctc gagcaggctg cggtaagtag cgatctcctg 240 
ctccagccgt gtctttatgt caagcagcat cttgtactcc tggttctgag cctccatctc 300 
gcatcggagc tcactcagac ctcgsccgsg mssmcgctam gccgaattcc ao;c 353 

<210> 139 
<211> 371 
<212> DNA 

<213> Homo sapiens 
<400> 139 
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agcgtggtcg cggccgaggt ccatccgaag caagattgca gatggcagtg tgaagagaga 60 
agacatattc tacacttcaa agctttggtg caattcccat cgaccagagt tggtccgacc 120 
agccttggaa aggtcactga aaaatcttca attggattat gttgacctct accttattca 180 
ttttccagtg tctgtaaagc caggtgagga agtgatccca aaagatgaaa atggaaaaat 240 
actatttgac acagtggatc tctgtgccac gtgggaggcc gtggagaagt gtaaagatgc 300 
aggattggac ctgcccgggc ggccgctcga aagccgaatt ccagcacact ggcggccgtt 360 
actagtggat c 371 

<210> 140 

<211> 370 » 

<212> DNA 

<213> Homo sapiens 

<400> 140 

tagcgtggtc gcggccgagg tccatctccc tttgggaact agggggctgc tggtgggaaa 60 
tgggagccag ggcagatgtt gcattccttt gtgtccctgt aaatgtggga ctacaagaag 120 
aggagctgcc tgagtggtac tttctcttcc tggtaatcct ctggcccagc ctcatggcag 180 
aatagaggta tttttaggct atttttgtaa tatggcttct ggtcaaaatc cctgtgtagc 240 
tgaattccca agccctgcat tgtacagccc cccactcccc tcaccaccta ataaaggaat 300 
agttaacact caaaaaaaaa aaaaaacctg cccgggcggc cgctcgaaag ccgaattcca 360 
gcacactggc 370 

<210> 141 
<211> 371 
<212> DNA 
<213> Homo sapiens 

<400> 141 

tagcgtggtc gcggccgagg tcctctgtgc tgcctgtcac agcccgatgg taccagcgca 60 
gggtgtaggc agtgcaggag ccctcatcca gtggcaggga acaggggtca tcactatccc 120 
aaggagcttc agggtcctgg tactcctcca cagaatactc ggagtattca gagtactcat 180 
catcctcagg cfggtacccgc tcttcctcct ctgcatgaga gacgcggagc acaggcacag 240 
catggagctg ggagccggca gtgtctgcag cataactagg gaggggtcgt gatccagatg 300 
cgatgaactg gccctggcag gcacagtgct gactcatctc ttggcgacct gcccgggcgg 360 
ccgctcgaag c 371 

<210> 142 
<211> 343 
<212> DNA 
<213> Homo sapiens 



<400> 142 

gcgttttgag gccaatggtg taaaaggaaa 
tgtcagaaac ctctttgtga tgtttgcttt 
agagcagttt tgaaacactc ttttgtagaa 
gtcttcattg gaaacgggat acctttacat 
tctttgggat gtgggcattc aacccacaga 
aacacccttt ttgtagaatc tacaggtgga 

<210> 14 3 
<211> 354 
<212> DNA 
<213> Homo sapiens 



tatcttcaca taaaaactag atggaagcat 60 
caactcacag agttgaacat tccttttcat 120 
tttgcaagcg gatgattgga tcgctatgag 180 
aaaaactaga cagtagcatt ctcagaaatt 240 
ggagaacttc atttgataga gcagttttga 300 
catttagagt get 343 



<400> 143 

aggtctgatg gcagaaaaac tcagactgtc 
catcaggagt gggatgggaa ggaaagcaca 
gtggtggagt gtgtcatgaa caatgtcacc 



tgcaacttta cagatggtgc attggttcag 60 
ataacaagaa aattgaaaga tgggaaatta 120 
tgtactcgga tctatgaaaa agtagaataa 180 
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aaattccatc atcactttgg acaggagtta attaagagaa tgaccaagct cagttcaatg 240 
agcaaatctc catactgttt ctttcttttt tttttcatta ctgtgttcaa ttatctttat 300 
^cataaacatt ttacatgcag ctatttcaaa gtgtgttgga ttaattagga teat 354 



<210> 144 
<211> 353 
<212> DNA 
<213> Homo sapiens 

<400> 144 

ggtcaaggac ctgggggacc cccaggtcca gcagccacat gattctgeag cagacaggga 60 

cctagagcac atctggatct cagccccacc cctggcaacc tgcctgccta gagaactccc 120 

aagatgacag actaagtagg attctgecat ttagaataat tctggtatcc tgggcgttgc 180 

gttaagttgc ttaactttca ttctgtctta cgatagtctt cagaggtggg aacagatgaa 240 

gaaaccatgc cccagagaag gttaagtgac ttcctcttta tggagccagt gttccaacct 300 

aggtttgect gataccagac ctgtggcccc acctcccatg caggtctctg tgg 353 

<210> 145 
<211> 371 
<212> DNA 
<213> Homo sapiens 

<400> 145 

caggtctgtc ataaactggt ctggagtttc tgacgactcc ttgttcacca aatgeaccat 60 
ttcctgagac ttgctggcct etcegttgag tccacttggc tttctgtcct ccacagctcc 120 
attgecactg ttgatcacta gctttttctt ctgcccacac cttcttcgac tgttgactgc 180 
aatgeaaact gcaagaatca aagecaagge caagagggat gecaagatga tcagccattc 240 
tggaatttgg ggtgtcctta taggaccaga ggttgtgttt gctccacctt cttgactccc 300 
atgtgagacc tcggccgcga ccacgctaag ccgaattcca gcacactggc ggcccqttac 360 
tagtggatcc g * 371 

<210> 146 
<211> 355 
<212> DNA 
<213> Homo sapiens 

<400> 146 

ggtcctccgt cctcttccca gaggtgtcgg ggcttggccc cagcctccat cttcgtctct 60 
caggatggcg agtagcagcg gctccaaggc tgaattcatt gtcggaggga aatataaact 120 
ggtacggaag ategggtctg gctccttcgg ggacatctat ttggcgatca acatcaccaa 180 
eggegaggaa gtggcagtga agctagaatc teagaaggee aggcatcccc agttgctgta 240 
cgagagcaag ctctataaga ttcttcaagg tggggttggc atcccccaca tacggtggta 300 
tggtcaggaa aaagactaca atgtactagt catggatctt ctgggaccta geetc 355 

<210> 147 

<211> 355 

<212> DNA 

<213> Homo sapiens 

<400> 147 

ggtctgttac aaaatgaaga cagacaacac aacatttact ctgtggagat atcctactca 60 
tactatgeae gtgctgtgat tttgaacata actcgtccca aaaacttgtc acgatcatcc 120 
tgacttttta ggttggctga tccatcaatc ttgcactcaa ctgttacttc tttcccagtg 180 
ttgttaggag caaagctgac ctgaacagca accaatggct gtagataccc aacatgeagt 240 
tttttcccat aatatgggaa atattttaag tctatcattc cattatgagg ataaactget 300 
acatttggta tatcttcatt ctttgaaaca caatctatcc ttggcactcc ttcag 355 

<210> 148 
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<211> 369 
<212> DNA 
x213> Homo sapiens 



<400> 148 

aggtctctct 

caccttcctt 

agggagtgtg 

atgtggcagc 

gctgcagcag 

gaaaagatga 

acttcttca 



ccccctctcc 
catgatgtgg 
ccgagggctt 
ccctcttctt 
cctccatcca 
gagaagttac 



ctctcctgcc 
gaagagtgct 
ctgagaaggt 
caagtggctc 
gcctgaggat 
agactctcct 



agccaagtga 
gcaacccagc 
ttctctcaca 
ttgtcctgtt 
gacatcaata 
gggcgacccc 



agacatgctt 
cctagccaac 
tctagaaaga 
gccctgggag 
cacagaggaa 
gagagcttac 



acttcccctt 60 
accgcatgag 120 
agcgcttaag 180 
ttctcaaatt 240 
gaagagtcag 300 
cattcctcag 360 
369 



<210> 149 
<211> 620 
<212> DNA 
<213> Homo sapiens 



<220> 

<221> misc_feature 

<222> 169, 171, 222, 472, 528, 559, 599 
<223> n = A,T,C or G 



<400> 149 

actagtcaaa aatgctaaaa taatttggga 
catgtttatc ttttattatg ttttgtgaag 
gccaat'attt ccttatatct atccataaca 
gtgaaactta acactttata aggtaaaaat 
gttcttgtta tttccaaata gaatggactt 
ataaggttaa aagttgttaa tgaccaaaca 
tttcaagcct tcgaactatt taaggaaagc 
gagaatttct cattaatatc ctgaatcatt 
atgtctctaa gaaagtacta tttcatggtc 
tttcccttaa gtgtgaaant atttaaaatg 
agggttaagg gtgttgggga 

<210> 150 
<211> 371 
<212> DNA 
<213> Homo sapiens 



gaaaatattt tttaagtagt gttatagttt 60 
ttgtgtcttt tcacfeaatta cctatactat 120 
tttatactac atttgtaana naatatgcac 180 
gaggtttcca anatttaata atctgatcaa 240 
ggtctgttaa gggctaagga gaagaggaag 300 
ttctaaaaga aatgcaaaaa aaaagtttat 360 
aaaatcattt cctaaatgca tatcatttgt 420 
catttcacta aggctcatgt tnactccgat 480 
caaacctggt tgccatantt gggtaaaggc 540 
aaattttcct ctttttaaaa attctttana 600 

620 



<400> 150 

ggtccgatca aaacctgcta cctccccaag 

gagcaaccag tatcacttcc ctgtttataa 

atgctgaaaa ccacctggtc tgcatgtatg 

aaaatttaat tttagggatt catttctata 

atatgtgtaa ggtgaaattt atggtatttg 

tcatttttcc cccagtgaat gatttagaat 
ttacttttat a 



actttactag tgccgataaa ctttctcaaa 60 

aacctctaac catctctttg ttctttgaac 120 

cccgaatttg yaattctttt ctctcaaatg 180 

ttttcacata tgtagtatta ttatttcctt 240 

agtgtgcaag aaaatatatt tttaaagctt 300 

tttttatgta aatatacaga atgttttttc 360 

371 



<210> 151 f 

<211> 4655 

<212> DNA 

<213> Homo sapiens 

<400> 151 

gggacttgag ttctgttatc ttcttaagta gattcatatt gtaagggtct cggggtgggg 60 
gggttggcaa aatcctggag ccagaagaaa ggacagcagc attgatcaat cttacagcta 120 



WO 02/00174 



PCT/US01/21065 



69 



acatgttgta cctggaaaac aatgcccaga ctcaatttag tgagccacag tacacgaacc 180 
tggggctcct gaacagcatg gaccagcaga ttcagaacgg ctcctcgtcc accagtccct 240 
^ataacacaga ccacgcgcag aacagcgtca cggcgccctc gccctacgca cagcccagct 300 
ccaccttcga tgctctctct ccatcacccg ccatcccctc caacaccgac tacccaggcc 360 
cgcacagttt cgacgtgtcc ttccagcagt cgagcaccgc caagtcggcc acctggacgt 420 
attccactga actgaagaaa ctctactgcc aaattgcaaa gacatgcccc atccagatca 480 
aggtgatgac cccacctcct cagggagctg ttatccgcgc catgcctgtc tacaaaaaag 540 
ctgagcacgt cacggaggtg gtgaagcggt gccccaacca tgagctgagc cgtgaattca 600 
acgagggaca gattgcccct yctagtcatt tgattcgagt agaggggaac agccatgccc 660 
, agtatgtaga agatcccatc acaggaagac agagtgtgct ggtaccttat gagccacccc 720 
aggttggcac tgaattcacg acagtcttgt acaatttcat gtgtaacagc agttgtgttg 780 
gagggatgaa ccgccgtcca attttaatca ttgttactct ggaaaccaga gatgggcaag 840 
tcctgggccg acgctgcttt gaggcccgga tctgtgcttg cccaggaaga gacaggaagg 900 
cggatgaaga tagcatcaga aagcagcaag tttcggacag tacaaagaac ggtgatggta 960 
cgaagcgccc gtttcgtcag aacacacatg gtatccagat gacatccatc aagaaacgaa 1020 
gatccccaga tgatgaactg gtatacttac cagtgagggg ccgtgagact tatgaaatgc 1080 
tggtgaagat caaagagtcc ctggaactca tgcagtacct tcttcagcac acaattgaaa 1140 
cgtacaggca acagcaacag cagcagcacc agcacttact tcagaaacag acctcaatac 1200 
agtctccatc ttcatatggt aacagctccc cacctctgaa caaaatgaac agcatgaaca 1260 
agctgccttc tgtgagccag cttatcaacc ctcagcagcg caacgccctc actcctacaa 1320 
ccattcctga tggcatggga gccaacattc ccatgatggg cacccacatg ccaatggctg 1380 
gagacatgaa tggactcagc cccacccagg cactccctcc cccactctcc atgccatcca 1440 
cctcccactg cacaccccca cctccgtatc ccacagattg cagcattgtc agtttcttag 1500 
cgaggttggg ctgttcatca tgtctggact atttcacgac ccaggggctg accaccatct 1560 
atcagattga gcattactcc atggatgatc tggcaagtct gaaaatccct gagcaatttc 1620 
gacatgcgat ctggaagggc atcctggacc accggcagct ccacgaattc tcctcccctt 1680 
ctcatctcct gcggacccca agcagtgcct ctacagtcag tgtgggctcc agtgagaccc 1740 
ggggtgagcg tgttattgat gctgtgcgat tcaccctccg ccagaccatc tctttcccac 1800 
cccgagatga gtggaatgac ttcaactttg acatggatgc tcgccgcaat aagcaacagc 1860 
gcatcaaaga ggagggggag tgagcctcac catgtgagct cttcctatcc ctctcctaac 1920 
tgccagcccc ctaaaagcac tcctgcttaa tcttcaaagc cttctcccta gctcctcccc 1980 
ttcctcttgt ctgatttctt aggggaagga gaagtaagag gcttacttct taccctaacc 2040 
atctgacctg gcatctaatt ctgattctgg ctttaagcct tcaaaactat agcttgcaga 2100 
actgtagctt gccatggcta ggtagaagtg agcaaaaaag agttgggtgt ctccttaagc 2160 
tgcagagatt tctcattgac ttttataaag catgttcacc cttatagtct aagactatat 2220 
atataaatgt ataaatatac agtatagatt tttgggtggg gggcattgag tattgtttaa 2280 
aatgtaattt aaatgaaaga aaattgagtt gcacttattg accatttttt aatttacttg 2340 
ttttggatgg cttgtctata ctccttccct taaggggtat catgtatggt gataggtatc 2400 
tagagcttaa tgctacatgt gagtgacgat gatgtacaga ttctttcagt tctttggatt 2460 
ctaaatacat gccacatcaa acctttgagt ! agatccattt ccattgctta ttatgtaggt 2520 
aagactgtag atatgtattc ttttctcagt gttggtatat tttatattac tgacatttct 2580 
tctagtgatg atggttcacg ttggggtgat ttaatccagt tataagaaga agttcatgtc 2640 
caaacgtcct ctttagtttt tggttgggaa tgaggaaaat tcttaaaagg cccatagcag 2700 
ccagttcaaa aacacccgac gtcatgtatt tgagcatatc agtaaccccc ttaaatttaa 2760 
taccagatac cttatcttac aatattgatt gggaaaacat ttgctgccat tacagaggta 2820 
ttaaaactaa atttcactac tagattgact aactcaaata cacatttgct actgttgtaa 2880 
gaattctgat tgatttgatt gggatgaatg ccatctatct agttctaaca gtgaagtttt 2940 
actgtctatt aatattcagg gtaaatagga atcattcaga aatgttgagt ctgtactaaa 3000 
cagtaagata tctcaatgaa ccataaattc aactttgtaa aaatcttttg aagcatagat 3060 
aatattgttt ggtaaatgtt tcttttgttt ggtaaatgtt tcytttaaag accctcctat 3120 
tctataaaac tctgcatgta gaggcttgtt tacctttctc tctctaaggt ttacaatagg 3180 
agtggtgatt tgaaaaatat aaaattatga gattggtttt cctgtggcat aaattgcatc 3240 
actgtatcat tttctttttt aaccggtaag agtttcagtt tgttggaaag taactgtgag 3300 
aacccagttt cccgtccatc tcccttaggg actacccata gacatgaaag gtccccacag 3360 
agcaagagat aagtctttca tggctgctgt tgcttaaacc acttaaacga agagttccct 3420 
tgaaactttg ggaaaacatg ttaatgacaa tattccagat ctttcagaaa tataacacat 3480 
ttttttgcat gcatgcaaat gagctctgaa atcttcccat gcattctggt caagggctgt 3540 
cattgcacat aagcttccat tttaatttta aagtgcaaaa gggccagcgt ggctctaaaa 3600 
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ggtaatgtgt ggattgcctc tgaaaagtgt gtatatattt tgtgtgaaat tgcatacttt 3660 
gtattttgat tatttttttt ttcttcttgg gatagtggga tttccagaac cacacttgaa 3720 
accttttttt atcgtttttg tattttcatg aaaataccat ttagtaagaa taccacatca 3780 
aataagaaat aatgctacaa ttttaagagg ggagggaagg gaaagttttt ttttttatta 3840 
tttttttaaa attttgtatg ttaaagagaa tgagtccttg atttcaaagt tttgttgtac 3900 
ttaaatggta ataagcactg taaacttctg caacaagcat gcagctttgc aaacccatta 3960 
aggggaagaa tgaaagctgt tccttggtcc tagtaagaag acaaactgct tcccttactt 4020 
tgctgagggt ttgaataaac ctaggacttc cgagctatgt cagtactatt caggtaacac 4080 
tagggccttg gaaatccctg tactgtgtct catggatttg gcactagcca aagcgaggca 4140 
ccccttactg gcttacctcc tcatggcagc ctactctcct tgagtgtatg agtagccagg 4200 
gtaaggggta aaaggatagt aagcatagaa accactagaa agtgggctta atggagttct 4260 
tgtggcctca gctcaatgca gttagctgaa gaattgaaaa gtttttgttt ggagacgttt 4320 
ataaacagaa atggaaagca gagttttcat taaatccttt tacctttttt ttttcttggt 4380 
aatcccctaa aataacagta tgtgggatat tgaatgttaa agggatattt ttttctatta 4440 
tttttataat tgtacaaaat taagcaaatg ttaaaagttt tatatgcttt attaatgttt 4500 
tcaaaaggta ttatacatgt gatacatttt ttaagcttca gttgcttgtc ttctggtact 4560 
ttctgttatg ggcttttggg gagccagaag ccaatctaca atctcttttt bftttgccagg 4620 
acatgcaata aaatttaaaa aataaataaa aacta • 4655 

<210> 152 
<211> 586 
<212> PRT 

<213> Homo sapiens 



<400> 152 



Met 


Leu 


Tyr 


Leu 


Glu 


Asn 


Asn 


Ala 


Gin 


Thr Gin 


Phe Ser Glu Pro Gin 


1 






5 










10 


15 


Tyr 


Thr 


Asn 


Leu 


Gly 


Leu 


Leu 


Asn 


Ser 


Met Asp 


Gin Gin He Gin Asn 






20 










25 




30 


Gly 


Ser 


Ser 


Ser 


Thr 


Ser 


Pro 


Tyr 


Asn 


Thr Asp 


His Ala Gin Asn Ser 






35 










40 






45 


Val 


Thr 


Ala 


Pro 


Ser 


Pro 


Tyr 


Ala 


Gin 


Pro Ser 


Ser Thr Phe Asp Ala 




50 










55 








60 


Leu 


Ser 


Pro 


Ser 


Pro 


Ala 


He 


Pro 


Ser 


Asn Thr 


Asp Tyr Pro Gly Pro 


65 










70 








75 


80 


His 


Ser 


Phe 


Asp 


Val 


Ser 


Phe 


Gin 


Gin 


Ser Ser 


Thr Ala Lys Ser Ala 








85 










90 


95 


Thr 


Trp 


Thr 


Tyr 


Ser 


Thr 


Glu 


Leu 


Lys 


Lys Leu 


Tyr Cys Gin He Ala 








100 










105 




110 


Lys 


Thr 


Cys 


Pro 


He 


Gin 


He 


Lys 


Val 


Met Thr 


Pro Pro Pro Gin Gly 






115 










120 






125 


Ala 


Val 


lie 


Arg 


Ala 


Met 


Pro 


Val 


Tyr 


Lys Lys 


Ala Glu His Val Thr 




130 










135 








140 


Glu 


Val 


Val 


Lys 


Arg 


Cys 


Pro 


Asn 


His 


Glu Leu 


Ser Arg Glu Phe Asn 


145 










150 








155 


160 


Glu 


Gly 


Gin 


He 


Ala 


Pro 


Ser 


Ser 


His 


Leu He 


Arg Val Glu Gly Asn 








165 










170 


175 


Ser 


His 


Ala 


Gin 


Tyr 


Val 


Glu 


Asp 


Pro 


He Thr 


Gly Arg Gin Ser Val 








180 










185 




190 


Leu 


Val 


Pro 


Tyr 


Glu 


Pro 


Pro 


Gin 


Val 


Gly Thr 


Glu Phe Thr Thr Val 






195 










200 






205 


Leu 


Tyr 


Asn 


Phe 


Met 


Cys 


Asn 


Ser 


Ser 


Cys Val 


Gly Gly Met 1 Asn Arg 




210 










215 








220 


Arg 


Pro 


He 


Leu 


He 


He 


Val 


Thr 


Leu 


Glu Thr 


Arg Asp Gly Gin Val 


225 










230 








235 


240 


Leu 


Gly 


Arg 


Arg 


Cys 


Phe 


Glu 


Ala 


Arg 


He Cys 


Ala Cys Pro Gly Arg 










245 










250 


255 


Asp 


Arg 


Lys 


Ala 


Asp 


Glu 


Asp 


Ser 


He 


Arg Lys 


Gin Gin Val Ser Asp 
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260 265 270 

Ser Thr Lys Asn Gly Asp Gly Thr Lys Arg Pro Phe Arg Gin Asn Thr 
* 275 280 285 

His Gly He Gin Met Thr Ser He Lys Lys Arg Arg Ser Pro Asp Asp 

290 295 300 

Glu Leu Val Tyr Leu Pro Val Arg Gly Arg Glu Thr Tyr Glu Met Leu 
305 310 315 320 

Val Lys He Lys Glu Ser Leu Glu Leu Met Gin Tyr Leu Leu Gin His 

325 330 335 

Thr He Glu Thr Tyr Arg Gin Gin Gin Gin Gin Gin His Gin His Leu 

340. 345 350 

Leu Gin Lys Gin Thr Ser He Gin Ser Pro Ser Ser Tyr Gly Asn Ser 

355 360 365 

Ser Pro Pro Leu Asn Lys Met Asn Ser Met Asn Lys Leu Pro Ser Val 

370 375 380 

Ser Gin Leu He Asn Pro Gin Gin Arg Asn Ala Leu Thr Pro Thr Thr. 
385 390 395 400 

He Pro Asp Gly Met Gly Ala Asn He Pro Met Met Gly Thr His Met 

405 410 415 

Pro Met Ala Gly Asp Met Asn Gly Leu Ser Pro Thr Gin Ala Leu Pro 

420 425 430 

Pro Pro Leu Ser Met Pro Ser Thr Ser His Cys Thr Pro Pro Pro Pro 

435 440 445 

Tyr Pro Thr Asp Cys Ser He Val Ser Phe Leu Ala Arg Leu Gly Cys 

450 455 460 

Ser Ser Cys Leu Asp Tyr Phe Thr Thr Gin Gly Leu Thr Thr He Tyr 
465 470 475 480 

Gin He Glu His Tyr Ser Met Asp Asp Leu Ala Ser Leu Lys He Pro 

485 490 495 

Glu Gin Phe Arg His Ala He Trp Lys Gly He Leu Asp His Arg Gin 

500 505 510 

Leu His Glu Phe Ser Ser Pro Ser His Leu Leu Arg Thr Pro Ser Ser 

515 520 525 

Ala Ser Thr Val Ser Val Gly Ser Ser Glu Thr Arg Gly Glu Arg Val 

530 535 , 540 

He Asp Ala Val Arg Phe Thr Leu Arg Gin Thr He Ser Phe Pro Pro 
545 550 555 560 

Arg Asp Glu Trp Asn Asp Phe Asn Phe Asp Met Asp Ala Arg Arg Asn 

565 570 575 

Lys Gin Gin Arg He Lys Glu Glu Gly Glu 
580 585 



<210> 153 

<211> 2007 

<212> DNA 

<213> Homo sapiens 

<400> 153 

gaattcgtcg ctgctccagg gaaagttctg 
acatggccag caagaaagta attacagtgt 
tggccagggc aattttggag agcaaaaaat 
cttgaccaaa tgccctggag ctccagcgcc 
atgataaagc atcggtggac agtgccttaa 
acttctggga ccctctcaac caagataagg 
ccgccaagca cctgggtctg aagcacgtgg 
tgacggatgg caagctggag gtgccgcact 
tctggtccat tggcatcccc atgaccagtg 



ttactccact gactctctct tttcctgata 60 
ttggagcaac aggagctcaa ggtggctctg 120 
ttgcagtgag agcagtgacc agggatgtga 180 
ttggagctga ggtggtcaaa ggtgacctga 240 
aaggtgtcta tggggccttc ttggtgacca 300 
aagtgtgtcg ggggaagctg gtggcagact 360 
tgtacagcgg cctggagaac gtcaagcgac 420 
ttgacagcaa gggcgaggtg gaggagtact 480 
tccgcgtggc ggcctacttt gaaaactttc 540 
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tcgcggcgtg gcggcccgtg aaagcctctg atggagatta ctacaccttg gctgtaccga 600 
tgggagatgt accaatggat ggtatctctg ttgctgatat tggagcagcc gtctctagca 660 
^tttttaattc tccagaggaa tttttaggca aggccgtggg gctcagtgca gaagcactaa 720 
caatacagca atatgctgat gttttgtcca aggctttggg gaaagaagtc cgagatgcaa 780 
agattacccc ggaagctttc gagaagctgg gattccctgc agcaaaggaa atagccaata 840 
tgtgtcgttt ctatgaaatg aagccagacc gagatgtcaa tctcacccac caactaaatc 900 
ccaaagtcaa aagcttcagc cagtttatct cagagaacca gggagccttc aagggcatgt 960 
agaaaatcag ctgttcagat aggcctctgc accacacagc ctctttcctc tctgatcctt 1020 
ttcctcttta cggcacaaca ttcatgttga cagaacatgc tggaatgcaa ttgtttgcaa 1080 
1 caccgaagga tttcctgcgg tcgcctcttc agtaggaagc actgcattgg tgataggaca 1140 
cggtaatttg attcacattt aacttgctag ttagtgataa gggtggtaca actgtttggt 1200 
aaaatgagaa gcctcggaac ttggagcttc tctcctacca ctaatgggag ggcagattat 1260 
actgggattt ctcctgggtg agtaatttca agccctaatg ctgaaattcc cctaggcagc 1320 
tccagttttc tcaactgcat tgcaaaattc ccagtgaact tttaagtact tttaacttaa 1380 
aaaaatgaac atctttgtag agaattttct ggggaacatg gtgttcaatg aacaagcaca 1440 
agcattggaa atgctaaaat tcagttttgc ctcaagattg gaagtttatt ttctgactca 1500 
ttcatgaagt catctattga gccaccattc aattattcat ctattaattc cttgatcctt 1560 
catttatcca ttctgcaaac ttttcttgag caccagcacg ggtggccatt tgtggacttc 1620 
tcttcattcc tatgtgtttt cttatcaaag tgatccactc tcgaaaggct cctttccagt 1680 
ctgtggttgg gttcaagtca tgccagggcc agggggccca tctcctcgtt tagctctagg 1740 
caaaatccag gggatctgca gtggggagcg ggggcaggaa gctggaggga aggcctgtga 1800 
agggtaggga tgtggaaaga caaggtgaca gaaggaccca ataggacctt tctatatctc 1860 
tggcttagca ttttctacat catattgtaa tcgtcttatt tgctagtttt cttccttact 1920 
gtgagtgact aacagtcatc tttatcccag tgcctggtac ataataagtg atcaataaat 1980 
gttgattgac taaaaaaaaa aaaaaaa 2007 

<210> 154 

<211> 2148 

<212> DNA 

<213> Homo sapiens 

<400> 154 

gaattcgtcg ctgctccagg gaaagttctg ttactccact gactctctct tttcctgata 60 
acatggccag caagaaagta attacagtgt ttggagcaac aggagctcaa ggtggctctg 120 
tggccagggc aattttggag agcaaaaaat ttgcagtgaq agcagtgacc agggatgtga 180 
cttgaccaaa tgccctggag ctccagcgcc ttggagctga ggtggtcaaa ggtgacctga 240 
atgataaagc atcggtggac agtgccttaa aaggggaagc tggtggcaga ctccgccaag 300 
cacctgggtc tgaagcacgt ggtgtacagc ggcctggaga acgtcaagcg actgacggat 360 
ggcaagctgg aggtgccgca ctttgacagc aagggcgagg tggaggagta cttctggtcc 420 
attggcatcc ccatgaccag tgtccgcgtg gcggcctact ttgaaaactt tctcgcggcg 480 
tggcggcccg tgaaagcctc tgatggagat tactacacct tggctgtacc gatgggagat 540 
gtaccaatgg atggtatctc tgttgctgat attggagcag ccgtctctag catttttaat 600 
tctccagagg aatttttagg caaggccgtg gggctcagtg cagaagcact aacaatacag 660 
caatatgctg atgttttgtc caaggctttg gggaaagaag tccgagatgc aaagactatc 720 
tgtgctatag atgaccagaa aacagtggaa gaaggtttca tggaagacgt gggcttgagt 780 
tggtccttga gggaacatga ccatgtatag acagaggagg catcaagaag gctggcctgg 840 
ctaattctgg aataaacacg acaaaccaga ggcagtacgg gaaggaggca aattctggct 900 
ctgcctctat ccttgattac cccggaagct ttcgagaagc tgggattccc tgcagcaaag 960 
gaaatagcca atatgtgtcg tttctatgaa atgaagccag accgagatgt caatctcacc 1020 
caccaactaa atcccaaagt caaaagcttc agccatttta tctcagagaa ccagggagcc 1080 
ttcaagggca tgtagaaaat cagctgttca gataggcctc tgcaccacac agcctctttc 1140 
ctctctgatc cttttcctct ttacggcaca acattcatgt tgacagaaca tgctggaatg 1200 
caattgtttg caacaccgaa ggatttcdtg cggtcgcctc ttcagtagga agcactgcat 1260 
tggtgatagg acacggtaat ttgattcaca tttaacttgc tagttagtga taagggtggt 1320 
acaactgttt ggtaaaatga gaagcctcgg aacttggagc ttctctccta ccactaatgg 1380 
gagggcagat tatactggga tttctcctgg gtgagtaatt tcaagcccta atgctgaaat 1440 
tcccctaggc agctccagtt ttctcaactg cattgcaaaa ttcccagtga acttttaagt 1500 
acttttaact taaaaaaatg aacatctttg tagagaattt tctggggaac atggtgttca 1560 
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atgaacaagc acaagcattg 
attttctgac tcattcatga 
; ttccttgatc cttcatttat 
atttgtggac ttctcttcat 
gctcctttcc agtctgtggt 
gtttagctct aggcaaaatc 
ggaaggcctg tgaagggtag 
ctttctatat ctctggctta 
tttcttcctt actgtgagtg 
z gtgatcaata aatgttgatt 

<210> 155 
<211> 153 
<212> PRT 

<213> Homo sapiens 
<400> 155 

Met Thr Ser Val Arg Val Ala Ala Tyr Phe Glu Asn Phe Leu Ala Ala 

1 5 10 15 

Trp Arg Pro Val Lys Ala Ser Asp Gly Asp Tyr Tyr Thr Leu Ala Val 

20 25 30 

Pro Met Gly Asp Val Pro Met Asp Gly He Ser Val Ala Asp He Glv 

35 40 45 

Ala Ala Val Ser Ser He Phe Asn Ser Pro Glu Glu Phe Leu Glv Lvs 

50 55 60 

Ala Val Gly Leu Ser Ala Glu Ala Leu Thr He Gin Gin Tyr Ala Asp 
65 70 75 80 

Val Leu Ser Lys Ala Leu Gly Lys Glu Val Arg Asp Ala Lys He Thr 

85 90 95 

Pro Glu Ala Phe Glu Lys Leu Gly Phe Pro Ala Ala Lys Glu He Ala 

100 105 no 

Asn Met Cys Arg Phe Tyr Glu Met Lys Pro Asp Arg Asp Val Asn Leu 

H5 120 125 

Thr His Gin Leu Asn Pro Lys Val Lys Ser Phe Ser Gin Phe He Ser 

130 135 140 

Glu Asn Gin Gly Ala Phe Lys Gly Met 
145 150 



gaaatgctaa aattcagttt tgcctcaaga ttggaagttt 1620 
agtcatctat tgagccacca ttcaattatt catctattaa 1680 
ccattctgca aacttttctt gagcaccagc acgggtggcc 1740 
tcctatgtgt tttcttatca aagtgatcca ctctcgaaag 1800 
tgggttcaag tcatgccagg gccagggggc ccatctcctc 1860 
caggggatct gcagtgggga gcgggggcag gaagctggag 1920 
ggatgtggaa agacaaggtg acagaaggac ccaataggac 1980 
gcattttcta catcatattg taatcgtctt atttgctagt 2040 
actaacagtc atctttatcc cagtgcctgg tacataataa 2100 
gactaaatga aaaaaaaaaa aaaaaaaa 214 8 



<210> 156 
<211> 128 
<212> PRT 

<213> Homo sapiens 
<400> 156 

Met Thr Ser Val Arg Val Ala Ala Tyr Phe Glu Asn Phe Leu Ala Ala 

15 io 15 

Trp Arg Pro Val Lys Ala Ser Asp Gly Asp Tyr Tyr Thr Leu Ala Val 

20 25 30 

Pro Met Gly Asp Val Pro Met Asp Gly He Ser Val Ala Asp He Gly 

35 40 45 

Ala Ala Val Ser Ser lie Phe Asn Ser Pro Glu Glu Phe Leu Glv Lvs 

50 55 60 

Ala Val Gly Leu Ser Ala Glu Ala Leu Thr He Gin Gin Tyr Ala Asp 
65 70 75 80 

Val Leu Ser Lys Ala Leu Gly Lys Glu Val Arg Asp Ala Lys Thr He 

85 90 95 

Cys Ala He Asp Asp Gin Lys Thr Val Glu Glu Gly Phe Met Glu Asp 
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/ 

100 105 110 

Val Gly Leu Ser Trp Ser Leu Arg Glu His Asp His Val Ala Gly Ala 
* 115 120 125 



<210> 157 
<211> 424 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<222> 320, 322 
<223> n = A, T, C or G 



<400> 157 

ctgcagcccg ggggatccac tagtccagtg 
ggatacatta cagcagacat ggaaatataa 
aattcagtca ccactgttat attaccttct 
tattagattt ccttgtatgc aaagtttttg 
agaggaagga gaaaactgca tcataacttt 
agcccagaaa cttctctgcn gnatctggct 
ccccagccat cgagtcagtt tgtgcccatg 
tgct 

<210> 158 
<211> 2099 
<212> DNA 

<213> Homo sapiens 



tggtggaatt cattggtctt tacaagactt 60 
ttttaaaaaa tttctctcca acctccttca 120 
ccaggaaccc tccagtgggg aaggctgcga 180 
ttgaaagctg tgctcagagg aggtgagagg 240 
acagaattga atctagagtc ttccccgaaa 300 
tgtccatctg gtctaaggtg gctgcttctt 360 
aataatacac gacctgctat ttcccatgac 420 

424 



<400> 158 

ccgcggttaa aaggcgcagc aggtgggagc 
ccgacagccg gcggcgcccg agcccgacct 
ccgcgcagag cccgcgccag ggccgccggc 
aaggcacttc ctgtcggtga agaagacctg 
caaacggggc tgacctccct tcctggggag 
agaagatctg gctaaacaat ttctgtatgg 
ttcatgcatc tttaattcaa tttgaatatt 
attgacattc gtatcatcac tgtgcaccat 
aaggaggtct gaaaccctcg cagagggatc 
gcagtcgttg gaaacaggac tcagggataa 
actttcatcg ggggtgtcaa caaacactcc 
atctttattt tccgagtcat gatcctcgtg 
caagaggact tcgtctgcaa cacactgcaa 
tttttcccgg tgtcccacat ccggctgtgg 
gcgctgctgg tggccatgca tgtggcctac 
cgaggagaga agaggaatga tttcaaagac 
atagaggggt cgctgtggtg gacgtacacc 
gcagccttta tgtatgtgtt ttacttcctt 
aaatgtggga ttgacccctg ccccaacctt 
aagaccgtgt ttaccatttt tatgatttct 
gcagagttgt gctacctgct gctgaaagtg 
caaaaaaatc accccaatca tgccctaaag 
atttcagata gtggtcaaaa tgcaatcaca 
tgtagctgcg tcataaggag acttctgtct 
ccttctgtag cctgaagagt ttgtaaatga 
tttttgtagg atacttgctc cattcataca 
aaaacaagag actgcttgac aaaggagcat 



cggggccttc acccgaaacc cgacgagagc 60 
gcctgcccag ccggagcgaa gggcgccgcc 120 
cgcagagcag ttaaaacgtg caggcaccag 180 
tctccggtg^ cacgggcatc ctgtgttttg 240 
caggaagggt cagggaagga aaagaagtac 300 
cgaaagaaaa attctaactt gtacgccctc 360 
ccaggcgaca tcctcactga ccgagcaaag 420 
tggcttctag gcactccagt ggggtaggag 480 
ttgccctcat tctttgggtc tgaaacactg 540 
accagcgcaa tggattgggg gacgctgcac 600 
accagcatcg ggaaggtgtg gatcacagtc 660 
gtggctgccc aggaagtgtg gggtgacgag 720 
ccgggatgca aaaatgtgtg ctatgaccac 780 
gccctccagc tgatcttcgt ctccacccca 840 
tacaggcacg aaaccactcg caagttcagg 900 
atagaggaca ttaaaaagca gaaggttcgg 960 
agcagcatct ttttccgaat catctttgaa 1020 
tacaatgggt accacctgcc ctgggtgttg 1080 
gttgactgct ttatttctag gccaacagag 1140 
gcgtctgtga tttgcatgct ^cttaacgtg 1200 
tgttttagga gatcaaagag agcacagacg 1260 
gagagtaagc agaatgaaat gaatgagctg 1320 
ggttcccaag ctaaacattt caaggtaaaa 1380 
tctccagaag gcaataccaa cctgaaagtt 144 0 
ctttcataat aaatagacac ttgagttaac 1500 
caacgtaatc aaatatgtgg tccatctctg 1560 
tgcagtcact ttgacaggtt ccttttaagt 1620 
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ggactctctg acaaagtggg tactttctga aaatttatat aactgttgtt gataaggaac 1680 
atttatccag gaattgatac gtttattagg aaaagatatt tttataggct tggatgtttt 1740 
%agttctgac tttgaattta tataaagtat ttttataatg actggtcttc cttacctgga 1800 
aaaacatgcg atgttagttt tagaattaca ccacaagtat ctaaatttgg aacttacaaa 1860 
gggtctatct tgtaaatatt gttttgcatt gtctgttggc aaatttgtga actgtcatga 1920 
tacgcttaag gtggaaagtg ttcattgcac aatatatttt tactgctttc tgaatgtaga 1980 
cggaacagtg tggaagcaga aggctttttt aactcatccg tttgccaatc attgcaaaca 2040 
actgaaatgt ggatgtgatt gcctcaataa agctcgtccc cattgcttaa aaaaaaaaa 2099 

<210> 159 
<211> 291 
<212> PRT 

<213> Homo sapiens 
<400> 159 

Met Asp Trp Gly Thr Leu His Thr Phe He Gly Gly Val Asn Lys His 

1 5 10 15 

Ser Thr Ser He Gly Lys Val Trp He Thr Val He Phe He Phe Arg 

20 25 30 

Val Met He Leu Val Val Ala Ala Gin Glu Val Trp Gly Asp Glu Gin 

35 40 45 

Glu Asp Phe Val Cys Asn Thr Leu Gin Pro Gly Cys Lys Asn Val Cys 

50 55 60 

Tyr Asp His Phe Phe Pro Val Ser His He Arg Leu Trp Ala Leu Gin 
65 70 75 80 

Leu He Phe Val Ser Thr Pro Ala Leu Leu Val Ala Met His Val Ala 

85 90 95 

Tyr Tyr Arg His Glu Thr Thr Arg Lys Phe Arg Arg Gly Glu Lys Arg 

100 105 " HO 

Asn Asp Phe Lys Asp He Glu Asp He Lys Lys Gin Lys Val Arg He 

115 120 125 

Glu Gly Ser Leu Trp Trp Thr Tyr Thr Ser Ser He Phe Phe Arg He 

130 135 140 

He Phe Glu Ala Ala Phe Met Tyr Val Phe Tyr Phe Leu Tyr Asn Gly 
145 150 155, J 160 

Tyr His Leu Pro Trp Val Leu Lys Cys Gly He Asp Pro Cys Pro Asn 

165 170 175 

Leu Val Asp Cys Phe He Ser Arg Pro Thr Glu Lys Thr Val Phe Thr 

180 185 190 

He Phe Met He Ser Ala Ser Val He Cys Met Leu Leu Asn Val Ala 

195 200 205 

Glu Leu Cys Tyr Leu Leu Leu Lys Val Cys Phe Arg Arg Ser Lys Arg 

210 215 220 

Ala Gin Thr Gin Lys Asn His Pro Asn His Ala Leu Lys Glu Ser Lys 
225 230 235 ' 240 

Gin Asn Glu Met Asn Glu Leu He Ser Asp Ser Gly Gin Asn Ala He 

245 250 255 

Thr Gly Ser Gin Ala Lys His Phe Lys Val Lys Cys Ser Cys Val He 

260 265 270 

Arg Arg Leu Leu Ser Ser Pro Glu Gly Asn Thr Asn Leu Lys Val Pro 

275 280 285 

Ser Val Ala 
290 



<210> 160 
<211> 3951 
<212> DNA 
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<213> Homo sapiens 
X400> 160 

tctgcatcca tattgaaaac ctgacacaat 
gaggcttctc tacaacatga cccaaaggag 
tgtgactctc ctggttgcct taagttcaga 
tcaagacaat gggtataatg gattgctcat 
gaacctcatc tcaaacatta aggaaatgat 
taccaagaga agagtatttt tcagaaatat 
/ taataataac agcaaaataa aacaagaatc 
ctggtatggg gcacatggag atgatccata 
gggaaaatac attcatttca cacctaattt 
cggatcacga ggccgagtgt ttgtccatga 
tgagtataac aatgacaaac ctttctacat 
gtgttcatct gacatcacag gcatttttgt 
ctgtattatt agtaagcttt ttaaagaagg 
tgcaactgca tcaataatgt tcatgcaaag 
•aagtacccac aaccaagaag caccaaacct 
atgggatgta atcacagact ctgctgactt 
gcttccacct cctcccacat tctcgcttgt 
gctggatgtg tccagcaaga tggcagaggc 
agaattttat ttgatgcaga ttgttgaaat 
cagcaaagga gagatcagag cccagctaca 
gctggtttca tatctgccca ccactgtatc 
gcttaagaaa ggatttgagg tggttgaaaa 
gatattagtg accagcggag atgataagct 
cagtggttca acaattcact ccattgccct 
attatcacgt cttacaggag gtttaaagtt 
catgattgat gctttcagta gaatttcctc 
tcagcttgaa agtacaggtg aaaatgtcaa 
tgtggataat actgtgggca acgacactat 
tcctgagatt atattatttg atcctgatgg 
caatctaact tttcggacag ctagtctttg 
gacttacacc ctgaacaata cccatcattc 
tcgcgcctcc aactcagctg tgcccccagc 
cctccatttt cctcatcctg tgatgattta 
tcttaatgcc actgtcactg ccacagttga 
actccttgat gatggagcag gtgctgatgt 
ttttttctcc tttgctgcaa atggtagata 
cagcataagc accccagccc actctattcc 
cacagcaaac ggtaatattc agatgaatgc 
ggagcgaaag tggggcttta gccgagtcag 
tccagctggc ccccaccctg atgtgtttcc 
aaaagtagaa gaggaattga ccctatcttg 
ccaggctaca agctatgaaa taagaatgag 
taacaatgct attttagtaa atacatcaaa 
gatatttacg ttctcacccc aaatttccac 
aacacatgaa agccacagaa tttatgttgc 
gtctgctgta tctaacattg cccaggcgcc 
acctgccaga gattatctta tattgaaagg 
catttgcctt attatagttg tgacacatca 
gaaagagaat ggaacaaaat tattataaat 
taagacccat ggccttcgac tacaaaaaca 
gtattaaaat gcattgagtt tttgtacaat 
aaattctttt tgggggtaga ttagaaaacc 
aattattctt taaagtaatg tctttaaagg 
aaggaaagtt tgttttattg aggtggaaaa 
tctgcattat aactgtctgt gtgaagcaat 



gtatgcagca ggctcagtgt gagtgaactg 60 
cattgcaggt cctatttgca acctgaagtt 120 
actcccattc ctgggagctg gagtacagct 180 
tgcaattaat cctcaggtac ctgagaatca 240 
aactgaagct tcattttacc tatttaatgc 300 
aaagatttta atacctgcca catggaaagc 360 
atatgaaaag gcaaatgtca tagtgactga 420 
caccctacaa tacagagggt gtggaaaaga 480 
cctactgaat gataacttaa cagctggcta 540 
atgggcccac ctccgttggg gtgtgttcga 600 
aaatgggcaa aatcaaatta aagtgacaag 660 
gtgtgaaaaa ggtccttgcc cccaagaaaa 720 
atgcaccttt atctacaata gcacccaaaa 780 
tttatcttct gtggttgaat tttgtaatgc 840 
acagaaccag atgtgcagcc tcagaagtgc 900 
tcaccacagc tttcccatga acgggactga 960 
agaggctggt gacaaagtgg tctgtttagt 1020 
tgacagactc cttcaactac aacaagccgc 1080 
tcataccttc gtgggcattg ccagtttcga 1140 
ccaaattaac agcaatgatg atcgaaagtt 1200 
agctaaaaca gacatcagca tttgttcagg 1260 
actgaatgga aaagcttatg gctctgtgat 1320 
tcttggcaat tgcttaccca ctgtgctcag 1380 
gggttcatct gcagccccaa atctggagga 1440 
ctttgttcca gatatatcaa actccaatag 1500 
tggaactgga gacattttcc agcaacatat 1560 
acctcaccat caattgaaaa acacagtgac 1620 
gtttctagtt acgtggcagg ccagtggtcc 1680 
acgaaaatac tacacaaata attttatcac 1740 
gattccagga acagctaagc ctgggcactg 1800 
tctgcaagcc ctgaaagtga cagtgacctc 1860 
cactgtggaa gcctttgtgg aaagagacag 1920 
tgccaatgtg aaacagggat tttatcccat 1980 
gccagagact ggagatcctg ttacgctgag 2040 
tataaaaaat gatggaattt actcgaggta 2100 
tagcttgaaa gtgcatgtca atcactctcc 2160 
agggagtcat gctatgtatg taccaggtta 2220 
tccaaggaaa tcagtaggca gaaatgagga 2280 
ctcaggaggc tccttttcag tgctgggagt 2340 
accatgcaaa attattgacc tggaagctgt 2400 
gacagcacct ggagaagact ttgatcaggg 2460 
taaaagtcta cagaatatcc aagatgactt 2520 
gcgaaatcct cagcaagctg gcatcaggga 2580 
gaatggacct gaacatcagc caaatggaga 2640 
aatacgagca atggatagga actccttaca 2700 
tctgtttatt ccccccaatt ctgatcctgt 2760 
agttttaaca gcaatgggtt tgataggaat 2820 
tactttaagc aggaaaaaga gagcagacaa 2880 
aaatatccaa agtgtcttcc ttcttagata 2940 
tactaacaaa gtcaaattaa catcaaaact 3000 
acagataaga tttttacatg gtagatcaac 3060 
cttacacttt ggctatgaac aaataataaa 3120 
caaagggaag ggtaaagtcg gaccagtgtc 3180 
atagccccaa gcagagaaaa ggagggtagg 3240 
catttagtta ctttgattaa tttttctttt 3300 
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ctccttatct glgcagaaca ggttgcttgt ttacaactga agatcatgct atatttcata 3360 
tatgaagccc ctaatgcaaa gctctttacc tcttgctatt ttgttatata tattacagat 3420 
.gaaatctcac tgctaatgct cagagatctt ttttcactgt aagaggtaac ctttaacaat 3480 
atgggtatta cctttgtctc ttcataccgg ttttatgaca aaggtctatt gaatttattt 3540 
gtttgtaagt ttctactccc atcaaagcag ctttttaagt tattgccttg gttattatgg 3600 
atgatagtta tagcccttat aatgccttaa ctaaggaaga aaagatgtta ttctgagttt 3660 
gttttaatac atatatgaac atatagtttt attcaattaa accaaagaag aggtcagcag 3720 
ggagatacta acctttggaa atgattagct ggctctgttt tttggttaaa taagagtctt 3780 
taatcctttc tccatcaaga gttacttacc aagggcaggg gaagggggat atagaggtcc 3840 
caaggaaata aaaatcatct ttcatcttta attttactcc ttcctcttat ttttttaaaa 3900 
gattatcgaa caataaaatc atttgccttt ttaattaaaa acataaaaaa a 3951 

<210> 161 
<211> 943 
<212> PRT 
<213> Homo sapiens 

<400> 161 

Met Thr Gin Arg Ser lie Ala Gly Pro lie Cys Asn Leu Lys Phe Val 

mi t 5 10 15 

Thr Leu Leu Val Ala Leu Ser Ser Glu Leu Pro Phe Leu Glv Ala Glv 

20 25 30 

Val Gin Leu Gin Asp Asn Gly Tyr Asn Gly Leu Leu He Ala He Asn 

35 40 45 

Pro Gin Val Pro Glu Asn Gin Asn Leu He Ser Asn He Lys Glu Met 

50 55 60 

He Thr Glu Ala Ser Phe Tyr Leu Phe Asn Ala Thr Lys Arg Arq Val 
«5 70 75 80 

Phe Phe Arg Asn He Lys He Leu He Pro Ala Thr Trp Lys Ala Asn 

85 90 95 

Asn Asn Ser Lys He Lys Gin Glu Ser Tyr Glu Lys Ala Asn Val He 

100 105 . no 

Val Thr Asp Trp Tyr Gly Ala His Gly Asp Asp Pro Tyr Thr Leu Gin 

115 120 125 

Tyr Arg Gly Cys Gly Lys Glu Gly Lys Tyr He His Phe Thr Pro Asn 

130 135 ~ ' 140 

Phe Leu Leu Asn Asp Asn Leu Thr Ala Gly Tyr Gly Ser Arg Gly Arq 
I 45 150 155 160 

Val Phe Val His Glu Trp Ala His Leu Arg Trp Gly Val Phe Asp Glu 

165 170 : 175 

Tyr Asn Asn Asp Lys Pro Phe Tyr He Asn Gly Gin Asn Gin He Lys 

180 185 190 

Val Thr Arg Cys Ser Ser Asp He Thr Gly He Phe Val Cys Glu Lys 

I 95 200 205 

Gly Pro Cys Pro Gin Glu Asn Cys He He Ser Lys Leu Phe Lys Glu 

210 215 220 

Gly Cys Thr Phe He Tyr Asn Ser Thr Gin Asn Ala Thr Ala Ser He 
225 230 235 240 

Met Phe Met Gin Ser Leu Ser Ser Val Val Glu Phe Cys Asn Ala Ser 

245 250 ' 255 

Thr His Asn Gin Glu Ala Pro Asn Leu Gin Asn Gin Met Cys Ser Leu 

260 265 270 ' 

Arg ser Ala Trp Asp Val He Thr Asp Ser Ala Asp Phe His His Ser 

275 280 285 

Phe Pro Met Asn Gly Thr Glu Leu Pro Pro Pro Pro Thr Phe Ser Leu 

290 295 300 

Val Glu Ala Gly Asp Lys Val Val Cys Leu Val Leu Asp Val Ser Ser 
305 3 10 315 320 
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Lys Met 


Ala dlu Ala 


Asp 


Arg 


Leu 




325 








^Phe Tyr 


Leu Met Gin 


He 


Val 


Glu 




340 








Ser Phe 


Asp Ser Lys 


Gly 


Glu 


He 




355 






360 


Ser Asn 


Asp Asp Arg 


Lys 


Leu 


Leu 


370 






375 




Ser Ala 


Lys Thr Asp 


He 


Ser 


He 


385 




390 






Glu Val 


Val Glu Lys 


Leu 


Asn 


Gly 




405 








Leu Val 


Thr Ser Gly 


Asp 


Asp 


Lys 




420 








Val Leu 


Ser Ser Gly 


Ser 


Thr 


He 




435 






440 


Ala Ala 


Pro Asn Leu 


Glu 


Glu 


Leu 


450 






455 




Phe Phe 


Val Pro Asp 


He 


Ser 


Asn 


465 




470 






Ser Arg 


He Ser Ser 


Gly 


Thr 


Gly 




485 








Leu Glu 


Ser Thr Gly 


Glu 


Asn 


Val 




500 








Thr Val 


Thr Val Asp 


Asn 


Thr 


Val 




515 






520 


Thr Trp 


Gin Ala Ser 


Gly 


Pro 


Pro 


530 






535 




Gly Arg 


Lys Tyr Tyr 


Thr 


Asn 


Asn 


545 




550 






Thr Ala 


Ser Leu Trp 


He 


Pro 


Gly 




565 








Tyr Thr 


Leu Asn Asn 


Thr 


His 


His 




580 








Val Thr 


Ser Arg Ala 


Ser 


Asn 


Ser 




595 






600 


Ala Phe 


Val Glu Arg 


Asp 


Ser 


Leu 


610 






615 




Tyr Ala 


Asn Val Lys 


Gin 


Gly 


Phe 


625 




630 






Thr Ala 


Thr Val Glu 


Pro 


Glu 


Thr 




645 








Leu Asp 


Asp Gly Ala 


Gly 


Ala 


Asp 




660 








Ser Arg 


Tyr Phe Phe 


Ser 


Phe 


Ala 




675 






680 


Val His 


Val Asn His 


Ser 


Pro 


Ser 


690 






695 




Pro Gly 


Ser His Ala 


Met 


Tyr 


Val 


705 




710 






lie Gin 


Met Asn Ala 


Pro 


Arg 


Lys 




725 








Arg Lys 


Trp Gly Phe 


Ser 


Arg 


Val 




740 








Leu Gly 


Val Pro Ala 


Gly 


Pro 


His 




755 






760 


He He 


Asp Leu Glu 


Ala 


Val 


Lys 


770 






775 





Leu 


Gin Leu Gin 


Gin Ala Ala Glu 




330 


335 


He 


His Thr Phe Val Gly He Ala 


345 




350 


Arg 


Ala Gin Leu 


His Gin He Asn 






365 


Val 


Ser Tyr Leu Pro Thr Thr Val 




380 




Cys 


Ser Gly Leu Lys Lys Gly Phe 




395 


400 


Lys 


Ala Tyr Gly 


Ser Val Met He 




410 


415 


Leu 


Leu Gly Asn 


Cys Leu Pro Thr 


425 




430 


His 


Ser He Ala 


Leu Gly Ser Ser 


• 




445 


Ser 


Arg Leu Thr 


Gly Gly Leu Lys 




460 




Ser 


Asn Ser Met 


He Asp Ala Phe 




475 


480 


Asp 


He Phe Gin 


Gin His He Gin 




490 


495 


Lys 


Pro His His 


Gin Leu Lys Asn 


505 




510 


Gly 


Asn Asp Thr 


Met Phe Leu Val 






525 


Glu 


He He Leu 


Phe Asp Pro Asp 




540 




Phe 


He Thr Asn 


Leu Thr Phe Arg 




555 


560 


Thr 


Ala Lys Pro 


Gly His Trp Thr 




570 


575 


Ser 


Leu Gin Ala 


Leu Lys Val Thr 


585 




590 


Ala 


Val Pro Pro 


Ala Thr Val Glu 






605 


His 


Phe Pro His 


Pro Val Met He 




620 




Tyr 


Pro He Leu 


Asn Ala Thr Val 


635 


640 


Gly 


Asp Pro Val 


Thr Leu Arg Leu 




650 


655 


Val 


He Lys Asn 


Asp Gly He Tyr 


665 




670 


Ala 


Asn Gly Arg 


Tyr Ser Leu Lys 






685 


He 


Ser Thr Pro 


Ala His Ser He 




700 




Pro 


Gly Tyr Thr 


Ala Asn Gly Asn 




715 


720 


Ser 


Val Gly Arg 


Asn Glu Glu Glu 




730 


' 735 


Ser 


Ser Gly Gly 


Ser Phe Ser Val 


745 




750 


Pro 


Asp Val Phe 


Pro Pro Cys Lys 






765 


Val 


Glu Glu Glu 


Leu Thr Leu Ser 




780 
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79 



Trp Thr Ala Pro Gly Glu Asp Phe Asp Gin Gly Gin Ala Thr Ser Tyr 
785 790 795 800 

x Glu He Arg Met Ser Lys Ser Leu Gin Asn He Gin Asp Asp Phe Asn 

805 810 815 

Asn Ala He Leu Val Asn Thr Ser Lys Arg Asn Pro Gin Gin Ala Gly 

820 825 830 

He Arg Glu He Phe Thr Phe Ser Pro Gin He Ser Thr Asn Gly Pro 

835 840 845 

Glu His Gin Pro Asn Gly Glu Thr His Glu Ser His Arg He Tyr Val 

850 855 860 

Ala He Arg Ala Met Asp Arg Asn Ser Leu Gin Ser Ala Val Ser Asn 
865 870 875 880 

He Ala Gin Ala Pro Leu Phe He Pro Pro Asn Ser Asp Pro Val Pro 

885 890 895 

Ala Arg Asp Tyr Leu He Leu Lys Gly Val Leu Thr Ala Met Gly Leu 

900 905 910 

He Gly He He Cys Leu He He Val Val Thr His His Thr Leu Ser 

915 920 925 

Arg Lys Lys Arg Ala Asp Lys Lys Glu Asn Gly Thr Lys Leu Leu 
930 935 940 



<210> 162 
<211> 498 
<212> DNA 
<213> Homo sapiens 

<400> 162 

tggagaacca cgtggacagc accatgaaca tgttgggcgg gggaggcagt gctggccgga 60 
agcccctcaa gtcgggtatg aaggagctgg ccgtgttccg ggagaaggtc actgagcagc 120 
accggcagat gggcaagggt ggcaagcatc accttggcct ggaggagccc aagaagctgc 180 
gaccaccccc tgccaggact ccctgccaac aggaactgga ccaggtcctg gagcggatct 240 
ccaccatgcg ccttccggat gagcggggcc ctctggagca cctctactcc ctgcacatcc 300 
ccaactgtga caagcatggc ctgtacaacc tcaaacagtg gcaagatgtc tctgaacggg 360 
cagcgtgggg agtgctggtg tgtgaacccc aacaccggga* agctgatcca gggagccccc 420 
accatccggg gggaccccga gtgtcatctc ttctacaatg agcagcagga ggctcgcggg 480 
gtgcacaccc cagcggat 49 g 

<210> 163 

<211> 1128 

<212> DNA 

<213> Homo sapiens 

<400> 163 

gccacctggc cctcctgatc gacgacacac gcacttgaaa cttgttctca gggtgtgtgg 60 
aatcaacttt ccggaagcaa ccagcccacc agaggaggtc ccgagcgcga gcggagacga 120 
tgcagcggag actggttcag cagtggagcg tcgcggtgtt cctgctgagc tacgcggtgc 180 
cctcctgcgg gcgctcggtg gagggtctca gccgccgcct caaaagagct gtgtctgaac 240 
atcagctcct ccatgacaag gggaagtcca tccaagattt acggcgacga ttcttccttc 300 
accatctgat cgcagaaatc cacacagctg aaatcagagc tacctcggag gtgtccccta 360 
actccaagcc ctctcccaac acaaagaacc accccgtccg atttgggtct gatgatgagg 420 
gcagatacct aactcaggaa actaacaagg tggagacgta caaagagcag ccgctcaaga 480 
cacctgggaa gaaaaagaaa ggcaagcccg ggaaacgcaa ggagcaggaa aagaaaaaac 540 
ggcgaactcg ctctgcctgg ttagactctg gagtgactgg gagtgggcta gaaggggacc 600 
acctgtctga cacctccaca acgtcgctgg agctcgattc acggaggcat tgaaattttc 660 
agcagagacc ttccaaggac atattgcagg attctgtaat agtgaacata tggaaagtat 720 
tagaaatatt tattgtctgt aaatactgta aatgcattgg aataaaactg tctcccccat 780 
tgctctatga aactgcacat tggtcattgt gaatattttt ttttttgcca aggctaatcc 840 
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aattattatt atcacattta ccataattta 

tgtatcttgg tgctgctgaa tttctatatt 

tatcaagtat gttgataaat gacacaatga 

aatgcctaaa tataattatc caaattgatt 

ttaaatttgt aaagaatgtc taataaaata 

<210> 164 
<211> 1310 
<212> DNA 
' <213> Homo sapiens 



ttttgtccat tgatgtattt attttgtaaa 900 
ttttgtaaca taatgcactt tagatataca 960 
agtgtctcta ttttgtggtt gattttaatg 1020 
ttcctttgtg catgtaaaaa taacagtatt 1080 
taatctaatt acatcatg 1128 



<400> 164 

gggcctggtt cgcaaagaag ctgacttcag agggggaaac tttcttcttt taggaggcgg 60 
ttagccctgt tccacgaacc caggagaact gctggccaga ttaattagac attgctatgg 120 
gagacgtgta aacacactac ttatcattga tgcatatata aaaccatttt attttcgcta 180 
ttatttcaga ggaagcgcct ctgatttgtt tcttttttcc ctttttgctc tttctggctg 240 
tgtggtttgg agaaagcaca gttggagtag ccggttgcta aataagtccc ^agcgcgagc 300 
ggagacgatg cagcggagac tggttcagca gtggagcgtc gcggtgttcc tgctgagcta 360 
cgcggtgccc tcctgcgggc gctcggtgga gggtctcagc cgccgcctca aaagagctgt 420 
gtctgaacat cagctcctcc atgacaaggg gaagtccatc caagatttac ggcgacgatt 480 
cttccttcac catctgatcg cagaaatcca cacagctgaa atcagagcta cctcggaggt 540 
gtcccctaac tccaagccct ctcccaacac aaagaaccac cccgtccgat ttgggtctga 600 
tgatgagggc agatacctaa ctcaggaaac taacaaggtg gagacgtaca aagagcagcc 660 
gctcaagaca cctgggaaga aaaagaaagg caagcccggg aaacgcaagg agcaggaaaa 720 
gaaaaaacgg cgaactcgct ctgcctggtt agactctgga gtgactggga gtgggctaga 780 
aggggaccac ctgtctgaca cctccacaac gtcgctggag ctcgattcac ggaggcattg 84 0 
aaattttcag cagagacctt ccaaggacat attgcaggat tctgtaatag tgaacatatg 900 
gaaagtatta gaaatattta ttgtctgtaa atactgtaaa tgcattggaa taaaactgtc 960 
tcccccattg ctctatgaaa ctgcacattg gtcattgtga atattttttt ttttgccaag 1020 
gctaatccaa ttattattat cacatttacc ataatttatt ttgtccattg atgtatttat 1080 
tttgtaaatg tatcttggtg ctgctgaatt tctatatttt ttgtaacata atgcacttta 1140 
gatatacata tcaagtatgt tgataaatga cacaatgaag tgtctctatt ttgtggttga 1200 
ttttaatgaa tgcctaaata taattatcca aattgatttt cctttgtgcc cgtaaaaata 1260 
acagtatttt aaatttgtaa agaatgtcta ataaaatata atctaattac 1310 

<210> 165 
<211> 177 
<212> PRT 

<213> Homo sapiens 



<400> 165 



Met 


Gin 


Arg 


Arg 


Leu Val Gin Gin Trp Ser Val 


Ala 


Val 


Phe Leu Leu 


1 








5 10 






15 


Ser 


Tyr 


Ala 


Val 


Pro Ser Cys Gly Arg Ser Val 


Glu 


Gly 


Leu Ser Arg 








20 


25 






30 


Arg 


Leu 


Lys 


Arg 


Ala Val Ser Glu His Gin Leu 


Leu 


His 


Asp Lys Gly 






35 




40 




45 




Lys 


Ser 


lie 


Gin 


Asp Leu Arg Arg Arg Phe Phe 


Leu 


His 


His Leu lie 




50 






55 


60 






Ala 


Glu 


lie 


His 


Thr Ala Glu lie Arg Ala Thr 


Ser 


Glu 


Val Ser Pro 


65 








70 75 






80 


Asn 


ISer 


Lys 


Pro 


Ser Pro Asn Thr Lys Asn His 


Pro 


Val 


Arg Phe Gly 










85 90 






95 


Ser 


Asp 


Asp 


Glu 


Gly Arg Tyr Leu Thr Gin Glu 


Thr 


Asn 


Lys Val Glu 








100 


105 






110 


Thr 


Tyr 


Lys 


Glu 


Gin Pro Leu Lys Thr Pro Gly 


Lys 


Lys 


Lys Lys Gly 






115 




120 




125 




Lys 


Pro 


Gly 


Lys 


Arg Lys Glu Gin Glu Lys Lys 


Lys 


Arg 


Arg Thr Arg 
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130 135 140 

Ser Ala Trp Leu Asp Ser Gly Val Thr Gly Ser Gly Leu Glu Gly Asp 
;i45 150 155 160 

His Leu Ser Asp Thr Ser Thr Thr Ser Leu Glu Leu Asp Ser Arg Arg 
165 170 175 

His 



<210> 166 
<211> 177 
<212> PRT 
<213> Homo sapiens 

<400> 166 

Met Gin Arg Arg Leu Val Gin Gin Trp Ser Val Ala Val Phe Leu Leu 

1 5 10 15 

Ser Tyr Ala Val Pro Ser Cys Gly Arg Ser Val Glu Gly Leu Ser Arg 

20 25 30 

Arg Leu Lys Arg Ala Val Ser Glu His Gin Leu Leu His Asp Lys Gly 

35 40 45 

Lys Ser He Gin Asp Leu Arg Arg Arg Phe Phe Leu His His Leu He 

50 55 60 

Ala Glu He His Thr Ala Glu He Arg Ala Thr Ser Glu Val Ser Pro 
65 70 75 80 

Asn Ser Lys Pro Ser Pro Asn Thr Lys Asn His Pro Val Arg Phe Gly 

85 90 95 

Ser Asp Asp Glu Gly Arg Tyr Leu Thr Gin Glu Thr Asn Lys Val Glu 

100 105 HO 

Thr Tyr Lys Glu Gin Pro Leu Lys Thr Pro Gly Lys Lys Lys Lys Gly 

115 120 125 

Lys Pro Gly Lys Arg Lys Glu Gin Glu Lys Lys Lys Arg Arg Thr Arg 

130 135 140 

Ser Ala Trp Leu Asp Ser Gly Val Thr Gly Ser Gly Leu Glu Gly Asp 
145 150 15£ ~" 160 

His Leu Ser Asp Thr Ser Thr Thr Ser Leu Glu Leu Asp Ser Arg Arg 
165 170 175 

His 



<210> 167 

<211> 3362 

<212> DNA 

<213> Homo sapiens 

<400> 167 

cacaatgtat gcagcaggct cagtgtgagt 
aaggagcatt gcaggtccta tttgcaacct 
ttcagaactc ccattcctgg gagctggagt 
gctcattgca attaatcctc aggtacctga 
aatgataact gaagcttcat tttacctatt 
aaatataaag attttaatac ctgccacatg 
agaatcatat gaaaaggcaa atgtcatagt 
tccatacacc ctacaataca gagggtgtgg 
taatttccta ctgaatgata acttaacagc 
ccatgaatgg gcccacctcc gttggggtgt 
ctacataaat gggcaaaatc aaattaaagt 



gaactggagg cttctctaca acatgaccca 60 
gaagtttgtg actctcctgg ttgccttaag 120 
acagcttcaa gacaatgggt ataatggatt 180 
gaatcagaac ctcatctcaa a.cattaagga 240 
taatgctacc aagagaagag tatttttcag 300 
gaaagctaat aataacagca aaataaaaca 360 
gactgactgg tatggggcac atggagatga 420 
aaaagaggga aaatacattc atttcacacc 480 
tggctacgga tcacgaggcc gagtgtttgt 540 
gttcgatgag tataacaatg acaaaccttt 600 
gacaaggtgt tcatctgaca tcacaggcat 660 



WO 02/00174 



82 



PCT/US01/21065 



ttttgtgtgt gaaaaaggtc cttgccccca agaaaactgt attattagta agctttttaa 720 
agaaggatgc acctttatct acaatagcac ccaaaatgca actgcatcaa taatgttcat 780 
; gcaaagttta tcttctgtgg ttgaattttg taatgcaagt acccacaacc aagaagcacc 840 
aaacctacag aaccagatgt gcagcctcag aagtgcatgg gatgtaatca cagactctgc 900 
tgactttcac cacagctttc ccatgaacgg gactgagctt ccacctcctc ccacattctc 960 
gcttgtagag gctggtgaca aagtggtctg tttagtgctg gatgtgtcca gcaagatggc 1020 
agaggctgac agactccttc aactacaaca agccgcagaa ttttatttga tgcagattgt 1080 
tgaaattcat accttcgtgg gcattgccag tttcgacagc aaaggagaga tcagagccca 1140 
gctacaccaa attaacagca atgatgatcg aaagttgctg gtttcatatc tgcccaccac 1200 
t tgtatcagct aaaacagaca tcagcatttg ttcagggctt aagaaaggat ttgaggtggt 1260 
tgaaaaactg aatggaaaag cttatggctc tgtgatgata ttagtgacca gcggagatga 1320 
taagcttctt ggcaattgct tacccactgt gctcagcagt ggttcaacaa ttcactccat 1380 
tgccctgggt tcatctgcag ccccaaatct ggaggaatta tcacgtctta caggaggttt 1440 
aaagttcttt gttccagata tatcaaactc caatagcatg attgatgctt tcagtagaat 1500 
ttcctctgga actggagaca ttttccagca acatattcag cttgaaagta caggtgaaaa 1560 
tgtcaaacct caccatcaat tgaaaaacac agtgactgtg gataatactg tgggcaacga 1620 
cactatgttt ctagttacgt- ggcaggccag tggtcctcct gagattatat tatttgatcc 1680 
tgatggacga aaatactaca caaataattt tatcaccaat ctaacttttc ggacagctag 1740 
tctttggatt ccaggaacag ctaagcctgg gcactggact tacaccctga tgtgtttcca 1800 
ccatgcaaaa ttattgacct ggaagctgta aaagtagaag aggaattgac cctatcttgg 1860 
acagcacctg gagaagactt tgatcagggc caggctacaa gctatgaaat aagaatgagt 1920 
aaaagtctac agaatatcca agatgacttt aacaatgcta ttttagtaaa tacatcaaag 1980 
cgaaatcctc agcaagctgg catcagggag atatttacgt tctcacccca aatttccacg 2040 
aatggacctg aacatcagcc aaatggagaa acacatgaaa gccacagaat ttatgttgca 2100 
atacgagcaa tggataggaa ctccttacag tctgctgtat ctaacattgc ccaggcgcct 2160 
ctgtttattc cccccaattc tgatcctgta cctgccagag attatcttat attgaaagga 2220 
gttttaacag caatgggttt gataggaatc atttgcctta ttatagttgt gacacatcat 2280 
actttaagca ggaaaaagag agcagacaag aaagagaatg gaacaaaatt attataaata 2340 
aatatccaaa gtgtcttcct tcttagatat aagacccatg gccttcgact acaaaaacat 2400 
actaacaaag tcaaattaac atcaaaactg tattaaaatg cattgagttt ttgtacaata 2460 
cagataagat ttttacatgg tagatcaaca aattcttttt gggggtagat tagaaaaccc 2520 
ttacactttg gctatgaaca aataataaaa attattcttt aaagtaatgt ctttaaaggc 2580 
aaagggaagg gtaaagtcgg accagtgtca aggaaagttt gttttattga ggtggaaaaa 2640 
tagccccaag cagagaaaag gagggtaggt ctgcattata actgtctgtg tgaagcaatc 2700 
atttagttac tttgattaat ttttcttttc tccttatcto; tgcagaacag gttgcttgtt 2760 
tacaactgaa gatcatgcta tatttcatat atgaagcccc taatgcaaag ctctttacct 2820 
cttgctattt tgttatatat attacagatg aaatctcact gctaatgctc agagatcttt 2880 
tttcactgta agaggtaacc tttaacaata tgggtattac ctttgtctct tcataccggt 2940 
tttatgacaa aggtctattg aatttatttg tttgtaagtt tctactccca tcaaagcagc 3000 
tttctaagtt attgccttgg ttattatgga tgatagttat agcccttata atgccttaac 3060 
taaggaagaa aagatgttat tctgagtttg ttttaataca tatatgaaca tatagtttta 3120 
ttcaattaaa ccaaagaaga ggtcagcagg gagatactaa cctttggaaa tgattagctg 3180 
gctctgtttt ttggttaaat aagagtcttt aatcctttct ccatcaagag ttacttacca 3240 
agggcagggg aagggggata tagaggtcac aaggaaataa aaatcatctt tcatctttaa 3300 
ttttactcct tcctcttatt tttttaaaag attatcgaac aataaaatca tttgcctttt 3360 
tt 3362 



<210> 168 

<211> 2784 

<212> DNA 

<213> Homo sapiens 

<400> 168 

tctgcatcca tattgaaaac 
gaggcttctc tacaacatga 
tgtgactctc ctggttgcct 
tcaagacaat gggtataatg 
gaacctcatc tcaaacatta 



ctgacacaat gtatgcagca 
cccaaaggag cattgcaggt 
taagttcaga actcccattc 
gattgctcat tgcaattaat 
aggaaatgat aactgaagct 



ggctcagtgt gagtgaactg 60 
cctatttgca acctgaagtt 120 
ctgggagctg gagtacagct 180 
cctcaggtac ctgagaatca 240 
tcattttacc tatttaatgc 300 
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taccaagaga agagtatttt tcagaaatat aaagatttta atacctgcca catggaaagc 360 
taataataac agcaaaataa aacaagaatc atatgaaaag gcaaatgtca tagtgactga 420 
.ctggtatggg gcacatggag atgatccata caccctacaa tacagagggt gtggaaaaga 480 
gggaaaatac attcatttca cacctaattt cctactgaat gataacttaa cagctggcta 540 
cggatcacga ggccgagtgt ttgtccatga atgggcccac ctccgttggg gtgtgttcga 600 
tgagtataac aatgacaaac ctttctacat aaatgggcaa aatcaaatta aagtgacaag 660 
^Zt ? f S gacatcacag gcatttttgt gtgtgaaaaa ggtccttgcc cccaagaaaa 720 
ctgtattatt agtaagcttt ttaaagaagg atgcaccttt atctacaata gcacccaaaa 780 
tgcaactgca tcaataatgt tcatgcaaag tttatcttct gtggttgaat tttgtaatgc 840 
aagtacccac aaccaagaag caccaaacct acagaaccag atgtgcagcc tcagaagtgc 900 
atgggatgta atcacagact ctgctgactt tcaccacagc tttcccatga acgggactga 960 
gcttccacct cctcccacat tctcgcttgt agaggctggt gacaaagtgg tctgtttagt 1020 
gctggatgtg tccagcaaga tggcagaggc tgacagactc cttcaactac aacaagccgc 1080 
agaattttat ttgatgcaga ttgttgaaat tcataccttc gtgggcattg ccagtttcga 1140 
cagcaaagga gagatcagag cccagctaca ccaaattaac agcaatgatg atcgaaagtt 1200 
gctggtttca tatctgccca ccactgtatc agctaaaaca gacatcagca tttgttcagg 1260 
gcttaagaaa ggatttgagg tggttgaaaa actgaatgga aaagcttatg gctctgtgat 1320 
gatattagtg accagcggag atgataagct tcttggcaat tgcttaccca ctgtgctcag 1380 
cagtggttca acaattcact ccattgccct gggttcatct gcagccccaa atctggagga 1440 
attatcacgt cttacaggag gtttaaagtt ctttgttcca gatatatcaa actccaatag 1500 
catgattgat gctttcagta gaatttcctc tggaactgga gacattttcc agcaacatat 1560 
tcagcttgaa agtacaggtg aaaatgtcaa acctcaccat caattgaaaa acacagtgac 1620 
tgtggataat actgtgggca acgacactat gtttctagtt acgtggcagg ccagtggtcc 1680 
tcctgagatt atattatttg atcctgatgg acgaaaatac tacacaaata attttatcac 1740 
caatctaact tttcggacag ctagtctttg gattccagga acagctaagc ctgggcactg 1800 
gacttacacc ctgaacaata cccatcattc tctgcaagcc ctgaaagtga cagtgacctc 1860 
tcgcgcctcc aactcagctg tgcccccagc cactgtggaa gcctttgtgg aaagagacag 1920 
cctccatttt cctcatcctg tgatgattta tgccaatgtg aaacagggat tttatcccat 1980 
tcttaatgcc actgtcactg ccacagttga gccagagact ggagatcctg ttacgctgag 2040 
actccttgat gatggagcag gtgctgatgt tataaaaaat gatggaattt actcgaggta 2100 
ttttttctcc tttgctgcaa atggtagata tagcttgaaa gtgcatgtca atcactctcc 2160 
cagcataagc accccagccc actctattcc agggagtcat gctatgtatg taccaggtta 2220 
cacagcaaac ggtaatattc agatgaatgc tccaaggaaa tcagtaggca gaaatgagga 2280 
ggagcgaaag tggggcttta gccgagtcag ctcaggaggc tccttttcag tgctgggagt 2340 
tccagctggc ccccaccctg atgtgtttcc accatgcaaa attattgacc tggaagctgt 2400 
aaatagaaga ggaattgacc ctatcttgga cagcacctgg 1 agaagacttt gatcagggcc 2460 
aggctacaag ctatgaaata agaatgagta aaagtctaca gaatatccaa gatgacttta 2520 
acaatgctat tttagtaaat acatcaaagc gaaatcctca gcaagctggc atcagggaga 2580 
tatttacgtt ctcaccccaa atttccacga atggacctga acatcagcca aatggagaaa 2640 
cacatgaaag ccacagaatt tatgttgcaa tacgagcaat ggataggaac tccttacagt 2700 
ctgctgtatc taacattgcc caggcgcctc tgtttattcc ccccaattct gatcctgtac 2760 
ctgccagaga ttatcttata ttga 2784 

<210> 169 
<211> 592 
<212> PRT 

<213> Homo sapiens 
<400> 169 

Met Thr Gin Arg Ser He Ala Gly Pro He Cys Asn Leu Lys Phe Val 

15 10 15 

Thr Leu Leu Val Ala Leu Ser Ser Glu Leu Pro Phe Leu Gly Ala Glv 
20 25 30 • 

Val Gin Leu Gin Asp Asn Gly Tyr Asn Gly Leu Leu He Ala He Asn 

35 40 45 

Pro Gin Val Pro Glu Asn Gin Asn Leu He Ser Asn He Lys Glu Met 

50 55 60 

He Thr Glu Ala Ser Phe Tyr Leu Phe Asn Ala Thr Lys Arg Arg Val 
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65 

Phe Phe 

-\ 

Asn Asn 

Val Thr 

Tyr Arg 
130 
Phe Leu 
145 

Val Phe 

Tyr Asn 

Val Thr 

Gly Pro 
210 
Gly Cys 
225 

Met Phe 

Thr His 

Arg Ser 

Phe Pro 
290 
Val Glu 
305 

Lys Met 

Phe Tyr 

Ser Phe 

Ser Asn 
370 
Ser Ala 
385 

Glu Val 

Leu Val 

Val Leu 

Ala Ala 
450 
Phe Phe 
465 

Ser Arg 
Leu Glu 
Thr Val 
Thr Trp 



Arg Asn 

Ser Lys 
100 
Asp Trp 
115 

Gly Cys 

Leu Asn 

Val His 

Asn Asp 
180 
Arg Cys 
195 

Cys Pro 

Thr Phe 

Met Gin 

Asn Gin 
260 
Ala Trp 
275 

Met Asn 

Ala Gly 

Ala Glu 

Leu Met 
340 
Asp Ser 
355 

Asp Asp 

Lys Thr 

Val Glu 

Thr Ser 
420 
Ser Ser 
435 

Pro Asn 

Val : Pro 

He Ser 

Ser Thr 
500 
Thr Val 
515 

Gin Ala 



70 
He Lys 
85 

He Lys 

Tyr Gly 

Gly Lys 

Asp Asn 
150 
Glu Trp 
165 

Lys Pro 

Ser Ser 

Gin Glu 

He Tyr 
230 
Ser Leu 
245 

Glu Ala 

Asp Val 

Gly Thr 

Asp Lys 
310 
Ala Asp 
325 

Gin He 

Lys Gly 

Arg Lys 

Asp He 
390 
Lys Leu 
405 

Gly Asp 

Gly Ser 

Leu Glu 

Asp He 
470 
Ser Gly 
485 

Gly Glu 
Asp Asn 
Ser Gly 



He Leu 

Gin Glu 

Ala His 
120 
Glu Gly 
135 

Leu Thr 

Ala His 

Phe Tyr 

Asp He 
200 
Asn Cys 
215 

Asn Ser 

Ser Ser 

Pro Asn 

He Thr 
280 
Glu Leu 
295 

Val Val 

Arg Leu 

Val Glu 

Glu He 
360 
Leu Leu 
375 

Ser He 

Asn Gly 

Asp Lys 

Thr He 
440 
Glu Leu 
455 

Ser Asn 

Thr Gly 

Asn Val 

Thr Val 
520 
Pro Pro 



He Pro 

90 
Ser Tyr 
105 

Gly Asp 

Lys Tyr 

Ala Gly 

Leu Arg 
170 
He Asn 
185 

Thr Gly 

He He 

Thr Gin 

Val Val 
250 
Leu Gin 
265 

Asp Ser 

Pro Pro 

Cys Leu 

Leu Gin 
330 
He His 
345 

Arg Ala 

Val Ser 

Cys Ser 

Lys Ala 
410 
Leu Leu 
425 

His Ser 

Ser Arg 

Ser Asn 

Asp He 
490 
Lys Pro 
505 

Gly Asn 



75 

Ala Thr 

Glu Lys 

Asp Pro 

He His 
140 
Tyr Gly 
155 

Trp Gly 

Gly Gin 

He Phe 

Ser Lys 
220 
Asn Ala 
235 

Glu Phe 

Asn Gin 

Ala Asp 

Pro Pro 
300 
Val Leu 
315 

Leu Gin 

Thr Phe 

Glii Leu 

Tyr Leu 
380 
Gly Leu 
395 

Tyr Gly 

Gly Asn 

He Ala 

Leu Thr 
460 
Ser Met 
475 

Phe Gin 



His His 
Asp Thr 
Glu He He Leu 



80 

Trp Lys Ala Asn 
95 

Ala Asn Val He 
110 

Tyr Thr Leu Gin 
125 

Phe Thr Pro Asn 

Ser Arg Gly Arg 
160 

Val Phe Asp Glu 
175 

Asn Gin He Lys 
190 

Val Cys Glu Lys 
205 

Leu Phe Lys Glu 

Thr Ala Ser He 
240 

Cys Asn Ala Ser 
255 

Met Cys Ser Leu 
270 

Phe His His Ser 
285 

Thr Phe Ser Leu 

Asp Val Ser Ser 
320 

Gin Ala Ala Glu 
335 

Val Gly He Ala 
350 

His Gin He Asn 
365 

Pro Thr Thr Val 

Lys Lys Gly Phe 
400 

Ser Val Met He 
415 

Cys Leu Pro Thr 
430 

Leu Gly Ser Ser 
445 

Gly Gly Leu Lys 

He Asp Ala Phe 
480 

Gin His 1 He Gin 
495 

Gin Leu Lys Asn 
510 

Met Phe Leu Val 
525 

Phe Asp Pro Asp 
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530 535 540 

Gly Arg Lys Tyr Tyr Thr Asn Asn Phe He Thr Asn Leu Thr Phe Arg 
^545 .550 555 560 

Thr Ala Ser Leu Trp He Pro Gly Thr Ala Lys Pro Gly His Trp Thr 

565 570 575 

Tyr Thr Leu Met Cys Phe His His Ala Lys Leu Leu Thr Trp Lys Leu 
580 585 590 



/ <210> 170 
<211> 791 
<212> PRT 

<213> Homo sapiens 
<400> 170 

Met Thr Gin Arg Ser He Ala Gly Pro He Cys Asn Leu Lys Phe Val 

1 5 10 15 

Thr Leu Leu Val Ala Leu Ser Ser Glu Leu Pro Phe Leu Gly Ala Gly 

20 25 30 

Val Gin Leu Gin Asp Asn Gly Tyr Asn Gly Leu Leu He Ala He Asn 

35 40 45 

Pro Gin Val Pro Glu Asn Gin Asn Leu He Ser Asn He Lys Glu Met 

50 55 60 

He Thr Glu Ala Ser Phe Tyr Leu Phe Asn Ala Thr Lys Arg Arg Val 
65 70 75 80 

Phe Phe Arg Asn lie Lys He Leu He Pro Ala Thr Trp Lys Ala Asn 

85 90 95 

Asn Asn Ser Lys He Lys Gin Glu Ser Tyr Glu Lys Ala Asn Val He 

100 105 110 

Val Thr Asp Trp Tyr Gly Ala His Gly Asp Asp Pro Tyr Thr Leu Gin 

115 120 125 

Tyr Arg Gly Cys Gly Lys Glu Gly Lys Tyr He His Phe Thr Pro Asn 

130 135 140 

Phe Leu Leu Asn Asp Asn Leu Thr Ala Gly Tyr Gly Ser Arg Gly Arg 
145 150 155 160 

Val Phe Val His Glu Trp Ala His Leu Arg Trp Gly Val Phe Asp Glu 

165 170 175 

Tyr Asn Asn Asp Lys Pro Phe Tyr He Asn Gly Gin Asn Gin He Lys 

180 185 190 

Val Thr Arg Cys Ser Ser Asp He: Thr Gly He Phe Val Cys Glu Lys 

195 200 205 

Gly Pro Cys Pro Gin Glu Asn Cys He He Ser Lys Leu Phe Lys Glu 

210 215 220 

Gly Cys Thr Phe He Tyr Asn Ser Thr Gin Asn Ala Thr Ala Ser He 
225 230 235 240 

Met Phe Met Gin Ser Leu Ser Ser Val Val Glu Phe Cys Asn Ala Ser 

24 5 250 ~ 255 

Thr His Asn Gin Glu Ala Pro Asn Leu Gin Asn Gin Met Cys Ser Leu 

260 265 270 

Arg Ser Ala Trp Asp Val He Thr Asp Ser Ala Asp Phe His His Ser 

275 280 285 

Phe Pro Met Asn Gly Thr Glu Leu Pro Pro Pro Pro Thr Phe Ser Leu 

2 ^0 295 300 

Val Glu Ala Gly Asp Lys Val Val Cys Leu Val Leu Asp Val Ser Ser 
305 310 315 320 

Lys Met Ala Glu Ala Asp Arg Leu Leu Gin Leu Gin Gin Ala Ala Glu 

325 330 335 

Phe Tyr Leu Met Gin He Val Glu He His Thr Phe Val Gly He Ala 
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340 



Ser 


Phe 


Asp 


Ser 


L vs Gl v Glu He 






355 




360 




Asn 


Asp 


Asp 


Arcr TjVS Leu Leu 




370 






375 


Ser 


Ala 


Lys 


Thr 


Asd He Ser He 


385 








^90 


Glu 


Val 


Val 


Glu 


T,vq TvP>n Asn Glv 

XI y O XJvU /lOIl wxy 










405 


Leu 


Val 


Thr 


Ser 


flJl v A <?n Asn Lvs 

VjXy xiOJ^J Jiop j( & 












Val 
V ax 


T.on 

XlCLl 


OCX. 


Cor 

ocx 


Gl \r Cor Thr Tip 
vjxy ocx Xlix A-Lc 










440 

^± *i v 


Ala 


Ala 


Pro 


Asn 


Leu Glu Glu Leu 










4RR 


cue 


Pha 

t Uc 


V clX 


Pro 


A en Tie Car &cn 
-TVoXJ lie OCX noil 


1 OO 










oer 


Arg 


lie 


Ser 


Car f*l v r *Fl-tr C^l t p 

dcx oiy i nx vjiy 










4ft<-i 


Leu. 


ulU 


Ser 


*Phr 
XIX 17 


Gl \r f2l n & cn Val 
vsiy blU noil val 












Thr 

xixx 


Val 


X liX 


Va 1 
Val 


&cn 21 en TVi^* Val 
Aop noli lllx- Val 






DXD 




con 


lili 


xrp 


uxil 


/Via 


Cor /Til \f Pm Prn 
ocx "jxy xrx vj rx u 












Gl if 
biy 


Arg 


Lys 


Tyr 


•ptrr* Th r 21 en & an 
iyi 1 XXX ASH noli 












Thr 


Ala 


Ser 


Leu 


Tm Tie Prn Glv 

x x. l-j XJ-v lxu vjx y 












xyr 


Thr 


Leu 


Asn 


noxi xixx nxo nxo 








JOU 




Val 
v ai 


Thr 


Oar 
DQX 


Arg 


Zlla Car 21 cn Cot* 
nx ct ocx noil ocx 










600 


Ala 


xXXC 


Val 


VJX u 


21 rrt Asn Cot* T.on 
nx y no^/ ocx xic u 










OXJ 


lyr 


ai a 

nxa 


Asn 


Val 
Val 


iiys vjxh bxy xriic 


\j£.D 








n"3ft 


Thr 


Ala 


Thr 


Val 


Glu Pro Glu Thr 

\J-L LA XT X v V7XU X11L 










64 


_ 




Asp 


i*tiy 


Ala fZl tr 21 1 a A cr\ 
nxa crxy Ala nop 








DOU 




Ser 


Arg 


iyr 


Pho 

IT XlC 


pVia Cor Pha 21 1 a 
xr lie OCX rile AXd 










OOw 


Val 
Val 


Hi s 

XIX o 


Val 


A an 
noil 


Hi c Cor Prn Cot* 
11 XO OCX rxu OCX 












Pro 


Gly 


Ser 


His 


Ala Met Tyr Val 


705 








710 


He 


Gin 


Met 


Asn 


Ala Pro Arg Lys 










725 


Arg 


Lys 


Trp 


Gly 


Phe Ser Arg Val 








740 




Leu 


Gly 


Val 


Pro 


Ala Gly Pro His 






755 




760 


He 


He 


Asp 


Leu 


Glu Ala Val Asn 




770 






775 


Asp 


Ser 


Thr 


Trp 


Arg Arg Leu 


785 








790 



345 


350 






Arg Ala 


Gin Leu Hi s Gin 


He 


Asn 




365 






Val Ser 


Tvr Leu Prn Thr 

1 JJ- **CU ITXC 111X 


Thr 


Val 




380 






Cys Ser 


Glv Leu TiV«5 T.v*? 
vj_i__y aj^u xjyo xiyo 


Gly 


Phe 




395 




400 


Lvs Ala 

J W III <_1 


Tvr Glv Ser Val 


Met 


He 


410 




415 




Leu Leu 


Glv Asn Pvq Leu 
wxj /rton ^yo lieu 


Prn 
rx \j 


Thr 

X 1XX 


425 


430 






His Ser 

nxo ocx 


Tie Ala TiPii Glv 

x> ^ w nxci jjcu vjxv 


Ser 


Cor 
ocx 




445 






Cor Arrr 
ocx nx y 


T.oii Thr Gl v Gl v 
xjcu nix wxy vjx y 


Leu 


Lys 










Cor Asn 
ocx noil 


Ser Met" Tl e A<?n 
ocx xxc i«> xxe no^/ 


Ala 


Phe 




475 




480 


& cn T 1 o 
nop X lc 


Pho Gin Gin Hi c 
x lie uin vjxh nxo 


Tl o 
xxc 


Gl n 


4 QO 




4QR 




T. »»c prA 

xiyo it XX/ 


Hi c Hi q Gin T.oii 
xxx o nxo uin xicu 


Lys 


Asn 


DVD 


JlU 






fZ~] \r 21 cn 
Voxy noli 


Acr\ Thr Mot* Pho 
nop Xixx riet irixc 


T.on 


Val 
vax 










Gl n T 1 o 
bill lie 


Tlo T.on Pho 2Vcri 

xic xiclx xr xic nop 


Pro 


Asp 




Ran 






P*ho Tlo 
XtXXC xxc 


Thr A an T.on Thr 
x xix noil xtcui xixx 


Phe 


Arrr 
nx y 




««* «^>^ 




*J V V 


Thr Ala 
xixx nxa 


T.vc Prn Gl \r Hie 
xiyo rxu oxy nxo 


Trr» 


Thr 


570 




575 




Cor T.on 

OCX XICU 


Gl n Ala Leu Lvs 
vxii nx cx jjcu xiyo 


Val 


Thr 


585 


590 






Ala Val 


Prn Pro Al f\ Thr 

_ x. itjlv nxa xixx. 


Val 


Glu 




605 






His Phe 
xi-t-o rue 


Pro His Pro Val 

X X> KJ XI -X O ti.v V ClX 


Met 


He 




, 620 






Tyr Pro 


Tie Leu Asn Ala 


Thr 


Val 




635 




640 


Gl v Acri 
vjj. y no 


Pro Val Thr Leu 
rx v vax xixx ucu 


Arg 


Leu 


650 




655 




Val He 


T.wq Asn Aso Gl V 
jjy o noxi no ox y 


He 


Tvr 
xyi 


665 


670 






Ala Aon 
nxa noil 


Glv Arn Tvr Ser 
vj x y nx y x y x ocx 


Leu 


Tivc 
xiyo 




685 






He Ser 


Thr Pro Ala His 


Ser 


He 




700 






Pro Gly 


Tyr Thr Ala Asn 


Gly 


Asn 




715 




720 


Ser Val 


Gly Arg Asn Glu 


Glu 


Glu 


730 




735 




Ser Ser 


Gly Gly Ser Phe 


Ser 


Val 


745 


750 


i 




Pro Asp 


Val Phe Pro Pro 


Cys 


Lys 




765 






Arg Arg 


Gly He Asp Pro 


He 


Leu 



780 
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<210> 171 

,<211> 1491 

<212> DNA 

<213> Homo sapiens 



<400> 171 

cctcctgcca gccaagtgaa gacatgctta cttccccttc accttccttc atgatgtggg 60 
aagagtgctg caacccagcc ctagccaacg ccgcatgaga gggagtgtgc cgagggcttc 120 
tgagaaggtt tctctcacat ctagaaagaa gcgcttaaga tgtggcagcc cctcttcttc 180 
aagtggctct tgtcctgttg ccctgggagt tctcaaattg ctgcagcagc ctccacccag 240 
cctgaggatg acatcaatac acagaggaag aagagtcagg aaaagatgag agaagttaca 300 
gactctcctg ggcgaccccg agagcttacc attcctcaga cttcttcaca tggtgctaac 360 
agatttgttc ctaaaagtaa agctctagag gccgtcaaat tggcaataga agccgggttc 420 
caccatattg attctgcaca tgtttacaat aatgaggagc aggttggact ggccatccga 480 
agcaagattg cagatggcag tgtgaagaga gaagacatat tctacacttc aaagctttgg 540 
agcaattccc atcgaccaga gttggtccga ccagccttgg aaaggtcact gaaaaatctt 600 
caattggact atgttgacct ctatcttatt cattttccag tgtctgtaaa gccaggtgag 660 
gaagtgatcc caaaagatga aaatggaaaa atactatttg acacagtgga tctctgtgcc 720 
acatgggagg ccatggagaa gtgtaaagat gcaggattgg ccaagtccat cggggtgtcc 780 
aacttcaacc acaggctgct ggagatgatc ctcaacaagc cagggctcaa gtacaagcct 840 
gtctgcaacc aggtggaatg tcatccttac ttcaaccaga gaaaactgct ggatttctgc 900 
aagtcaaaag acattgttct ggttgcctat agtgctctgg gatcccatcg agaagaacca 960 
tgggtggacc cgaactccce ggtgctcttg gaggacccag tcctttgtgc cttggcaaaa 1020 
aagcacaagc gaaccccagc cctgattgcc ctgcgctacc agctgcagcg tggggttgtg 1080 
gtcctggcca agagctacaa tgagcagcgc atcagacaga acgtgcaggt gtttgaattc 1140 
cagttgactt cagaggagat gaaagccata gatggcctaa acagaaatgt gcgatatttg 1200 
acccttgata tttttgctgg cccccctaat tatccatttt ctgatgaata ttaacatgga 1260 
gggcattgca tgaggtctgc cagaaggccc tgcgtgtgga tggtgacaca gaggatggct 1320 
ctatgctggt gactggacac atcgcctctg gttaaatctc tcctgcttgg cgacttcagt 1380 
aagctacagc taagcccatc ggccggaaaa gaaagacaat aattttgttt ttcattttga 1440 
aaaaattaaa tgctctctcc taaagattct tcacctaaaa aaaaaaaaaa a 1491 

<210> 172 
<211> 364 

<212> PRT ' 
<213> Homo sapiens 

<400> 172 

Met Trp Gin Pro Leu Phe Phe Lys Trp Leu Leu Ser Cys Cys Pro Gly 

1 5 10 y is 

Ser Ser Gin He Ala Ala Ala Ala Ser Thr Gin Pro Glu Asp Asp He 

20 25 30 

Asn Thr Gin Arg Lys Lys Ser Gin Glu Lys Met Arg Glu Val Thr Asp 

35 40 45 

Ser Pro Gly Arg Pro Arg Glu Leu Thr He Pro Gin Thr Ser Ser His 

50 55 60 

Gly Ala Asn Arg Phe Val Pro Lys Ser Lys Ala Leu Glu Ala Val Lvs 
65 70 75 so 

Leu Ala He Glu Ala Gly Phe His His He Asp Ser Ala His Val Tyr 

85 90 95 

Asn Asn Glu Glu Gin Val Gly Leu Ala He Arg Ser Lys He Ala Asp 

100 105 HO 

Gly Ser Val Lys Arg Glu Asp He Phe Tyr Thr Ser Lys Leu Trp Ser 

115 120 125 

Asn Ser His Arg Pro Glu Leu Val Arg Pro Ala Leu Glu Arg Ser Leu 

130 135 140 

Lys Asn Leu Gin Leu Asp Tyr Val Asp Leu Tyr Leu He His Phe Pro 
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145 








150 


155 


160 


Val 


Ser 


Val 


Lys 


Pro Gly Glu Glu 


Val lie Pro 


Lys Asp Glu Asn Gly 


•5 








165 


170 


175 


Lys 


lie 


Leu 


Phe 


Asp Thr Val Asp 


Leu Cys Ala 


Thr Trp Glu Ala Met 








180 




185 


190 


Glu 


Lys 


Cys 


Lys 


Asp Ala Gly Leu 


Ala Lys Ser 


lie Gly Val Ser Asn 






195 




200 




205 


Phe 


Asn 


His 


Arg 


Leu Leu Glu Met 


lie Leu Asn 


Lys Pro Gly Leu Lys 




210 






215 




220 


Tyr 


Lys 


Pro 


Val 


Cys Asn Gin Val 


Glu Cys His 


Pro Tyr Phe Asn Gin 


225 








230 


235 


240 


Arg 


Lys 


Leu 


Leu 


Asp Phe Cys Lys 


Ser Lys Asp 


lie Val Leu Val Ala 










245 


250 


255 


Tyr 


Ser 


Ala 


Leu 


Gly Ser His Arg 


Glu Glu Pro 


Trp Val Asp Pro Asn 








260 




265 


270 


Ser 


Pro 


Val 


Leu 


Leu Glu Asp Pro 


Val Leu Cys 


Ala Leu Ala Lys Lys 






275 




280 




285 


His 


Lys 


Arg 


Thr 


Pro Ala Leu lie 


Ala Leu Arg 


Tyr Gin Leu Gin Arg 




290 






295 . 




300 


Gly 


Val 


Val 


Val 


Leu Ala Lys Ser 


Tyr Asn Glu 


Gin Arg lie Arg Gin 


305 








310 


315 


320 


Asn 


Val 


Gin 


Val 


Phe Glu Phe Gin 


Leu Thr Ser 


Glu Glu Met Lys Ala 










325 


330 


335 


lie 


Asp 


Gly 


Leu 


Asn Arg Asn Val 


Arg Tyr Leu 


Thr Leu Asp lie Phe 








340 




345 


350 


Ala 


Gly 


Pro 


Pro 


Asn Tyr Pro Phe 


Ser Asp Glu 


Tyr 






355 




360 







<210> 173 

<211> 1988 

<212> DNA 

<213> Homo sapiens 

<400> 173 

cgggagccgc ctccccgcgg 
tctctgctgt cgcccgtccc 
ccgcgccgcc cgtcaacatg 
tgcccctgct cctactcagc 
ggttgcagtc tagcgaccac 
gcggcggcag cgggtcctac 
gagcagcggc tgccatgctc 
ccttcttcgc cctctgtgga 
ttgccttggc tgctgtgttc 
agaccttcac ccttcatgcc 
ttgggtgggc agccacgatt 
actacgaaga tgaccttctg 
ttgggaatga atgtgggaga 
tttctccagg cgactttgaa 
aatgctaaaa taatttggga 
ttttattatg ttttgtgaag 
ccttatatct atccataaca 
acactttata aggtaaaaat 
tttccaaata gaatggactt 
aagttgttaa tgaccaaaca 
tcgaactatt taaggaaagc 
cattaatatc ctgaatcatt 
aggaaagtac tatttcatgg 



cctcttcgct tttgtggcgg 
gcgcgctcct ccgacccgct 
atccgctgcg gcctggcctg 
gccatcgcct tcgacatcat 
ggccagacgt cctcgctgtg 
gaggagggct gtcagagcct 
ttctgtggct tcatcatcct 
ccccagatgc ttgtcttcct 
cagatcatct ccctggtaat 
aaccctgctg tcacttacat 
atcctgatcg gctgtgcctt 
ggcaatgcca agcccaggta 
aaatcgctgc tgctgagatg 
cccatttttt ggcagtgttc 
gaaaatattt tttaagtagt 
ttgtgtcttt tcactaatta 
tttatactac atttgtaaga 
gaggtttcca agatttaata 
ggtctgttaa gggctaagga 
ttctaaaaga aatgcaaaaa 
aaaatcattt cctaaatgca 
catttcagct aaggcttcat 
tccaaacctg ttgccatagt 



cgcccgcgct cgcaggccac 60 
ccgctccgct ccgctcggcc 120 
cgagcgctgc cgctggatcc 180 
cgcgctggcc ggccgcggct 240 
gtggaaatgc tcccaagagg 300 
catggagtac gcgtggggta 360 
ggtgatctgt ttcatcctct 420 
gagagtgatt ggaggtctcc 480 
ttaccccgtg aagtacaccc 540 
ctataactgg gcctacggct 600 
cttcttctgc tgcctcccca 660 
cttctacaca tctgcctaac 720 
gactccagaa gaagaaactg 780 
atattattaa actagtcaaa 840 
gttatagttt catgtttatc 900 
cctatactat ^rccaatattt 960 
gaatatgcac gtgaaactta 1020 
atctgatcaa gttcttgtta 1080 
gaagaggaag ataaggttaa 1140 
aaaagtttat tttcaagcct 1200 
tatcatttgt gagaatttct 1260 
gttgactcga tatgtcatct 1320 
tggtaaggct ttcctttaag 1380 
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tgtgaaatat ttagatgaaa ttttctcttt taaagttctt tatagggtta gggtgtggga 1440 
.aaatgctata ttaataaatc tgtagtgttt tgtgtttata tgttcagaac cagagtagac 1500 
tggattgaaa gatggactgg gtctaattta tcatgactga tagatctggt taagttgtgt 1560 
agtaaagcat taggagggtc attcytgtca caaaagtgcc actaaaacag cctcaggaga 1620 
ataaatgact tgcttttcta aatctcaggt ttatctgggc tctatcatat agacaggctt 1680 
ctgatagttt gcarctgtaa gcagaaacct acatatagtt aaaatcctgg tctttcttgg 1740 
taaacagatt ttaaatgtct gatataaaac atgccacagg agaattcggg gatttgagtt 1800 
tctctgaata gcatatatat gatgcatcgg ataggtcatt atgatttttt accatttcga 1860 
cttacataat gaaaaccaat tcattttaaa tatcagatta ttattttgta agttgtggaa. i920 
aaagctaatt gtagttttca ttatgaagtt ttcccaataa accaggtatt ctaaaaaaaa 1980 
aaaaaaaa 1988 

<210> 174 
<211> 238 
<212> PRT 
<213> Homo sapiens 

<400> 174 

Gly Ala Ala Ser Pro Arg Pro Leu Arg Phe Cys Gly Gly Ala Arg Ala 

15 10 15 

Arg Arg Pro Leu Ser Ala Val Ala Arg Pro Ala Arg Ser Ser Asp Pro 

20 25 30 

Leu Arg Ser Ala Pro Leu Gly Pro Ala Pro Pro Val Asn Met He Arg 

35 40 45 

Cys Gly Leu Ala Cys Glu Arg Cys Arg Trp He Leu Pro Leu Leu Leu 

50 55 60 

Leu Ser Ala He Ala Phe Asp He He Ala Leu Ala Gly Arg Gly Trp 
65 70 75 80 

Leu Gin Ser Ser Asp His Gly Gin Thr Ser Ser Leu Trp Trp Lys Cys 

85 90 95 

Ser Gin Glu Gly Gly Gly Ser Gly Ser Tyr Glu Glu Gly Cys Gin Ser 

100 105 HO 

Leu Met Glu Tyr Ala Trp Gly Arg Ala Ala Ala Ala Met Leu Phe Cys 

115 120 125 

Gly Phe He He Leu Val lie Cys Phe He Leu Ser Phe Phe Ala Leu 

130 135 140 

Cys Gly Pro Gin Met Leu Val Phe Leu Arg Val He Gly Gly Leu Leu 
145 150 155 *~ 160 

Ala Leu Ala Ala Val Phe Gin He He Ser Leu Val lie Tyr Pro Val 

165 170 175 

Lys Tyr Thr Gin Thr Phe Thr Leu His Ala Asn Pro Ala Val Thr Tyr 

180 185 190 

lie Tyr Asn Trp Ala Tyr Gly Phe Gly Trp Ala Ala Thr He lie Leu 

195 200 205 

He Gly Cys Ala Phe Phe Phe Cys Cys Leu Pro Asn Tyr Glu Asp Asp 

210 215 220 

Leu Leu Gly Asn Ala Lys Pro Arg Tyr Phe Tyr Thr Ser Ala 
225 230 235 



<210> 175 , 

<211> 4181 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 3347, 3502 r 3506, 3520, 3538, 3549, 3646, 3940, 3968, 3974, 
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4036 r 4056, 4062, 4080, 4088, 4115 
<223> n = A,T,C or G 

<400> 175 

ggtggatgcg tttgggttgt agctaggctt tttcttttct ttctctttta aaacacatct 60 
agacaaggaa aaaacaagcc tcggatctga tttttcactc ctcgttcttg tgcttggttc 120 
ttactgtgtt tgtgtatttt aaaggcgaga agacgagggg aacaaaacca gctggatcca 180 
tccatcaccg tgggtggttt taatttttcg ttttttctcg ttattttttt ttaaacaacc 240 
actcttcaca atgaacaaac tgtatatcgg aaacctcagc gagaacgccg ccccctcgga 300 
cctagaaagt atcttcaagg acgccaagat cccggtgtcg ggacccttcc tggtgaagac 360 
tggctacgcg ttcgtggact gcccggacga gagctgggcc ctcaaggcca tcgaggcgct 420 
ttcaggtaaa atagaactgc acgggaaacc catagaagtt gagcactcgg tcccaaaaag 480 
gcaaaggatt cggaaacttc agatacgaaa tatcccgcct catttacagt gggaggtgct 540 
ggatagttta ctagtccagt atggagtggt ggagagctgt gagcaagtga acactgactc 600 
ggaaactgca gttgtaaatg taacctattc cagtaaggac caagctagac aagcactaga 660 
caaactgaat ggatttcagt tagagaattt caccttgaaa gtagcctata tccctgatga 720 
aatggccgcc cagcaaaacc ccttgcagca gccccgaggt cgccgggggc ttgggcagag 780 
gggctcctca aggcaggggt ctccaggatc cgtatccaag cagaaaccat gtgatttgcc 840 
tctgcgcctg ctggttccca cccaatttgt tggagccatc ataggaaaag aaggtgccac 900 
cattcggaac atcaccaaac agacccagtc taaaatcgat gtccaccgta aagaaaatgc 960 
gggggctgct gagaagtcga ttactatcct ctctactcct gaaggcacct ctgcggcttg 1020 
taagtctatt ctggagatta tgcataagga agctcaagat ataaaattca cagaagagat 1080 
ccccttgaag attttagctc ataataactt tgttggacgt cttattggta aagaaggaag 1140 
aaatcttaaa aaaattgagc aagacacaga cactaaaatc acgatatctc cattgcagga 1200 
attgacgctg tataatccag aacgcactat tacagttaaa ggcaatgttg agacatgtgc 1260 
caaagctgag gaggagatca tgaagaaaat cagggagtct tatgaaaatg atattgcttc 1320 
tatgaatctt caagcacatt taattcctgg attaaatctg aacgccttgg gtctgttccc 1380 
acccacttca gggatgccac ctcccacctc agggccccct tcagccatga ctcctcccta 1440 
cccgcagttt gagcaatcag aaacggagac tgttcatcag tttatcccag ctctatcagt 1500 
cggtgccatc atcggcaagc agggccagca catcaagcag ctttctcgct ttgctggagc 1560 
ttcaattaag attgctccag cggaagcacc agatgctaaa gtgaggatgg tgattatcac 1620 
tggaccacca gaggctcagt tcaaggctca gggaagaatt tatggaaaaa ttaaagaaga 1680 
aaactttgtt agtcctaaag aagaggtgaa acttgaagct catatcagag tgccatcctt 1740 
tgctgctggc agagttattg gaaaaggagg caaaacggtg aatgaacttc agaatttgtc 1800 
aagtgcagaa gttgttgtcc ctcgtgacca gacacctgat gagaatgacc aagtggttgt 1860 
caaaataact ggtcacttct atgcttgcca ggttgcccag agaaaaattc aggaaattct 1920 
gactcaggta aagcagcacc aacaacagaa ggctctgcaa agtggaccac ctcagtcaag 1980 
acggaagtaa aggctcagga aacagcccac cacagaggca gatgccaaac caaagacaga 2040 
ttgcttaacc aacagatggg cgctgacccc ctatccagaa tcacatgcac aagtttttac 2100 
ctagccagtt gtttctgagg accaggcaac ttttgaactc ctgtctctgt gagaatgtat 2160 
actttatgct ctctgaaatg tatgacaccc agctttaaaa caaacaaaca aacaaacaaa 2220 
aaaagggtgg gggagggagg gaaagagaag agctctgcac ttccctttgt tgtagtctca 2280 
cagtataaca gatattctaa ttcttcttaa tattccccca taatgccaga aattggctta 2340 
atgatgcttt cactaaattc atcaaataga ttgctcctaa atccaattgt taaaattgga 2400 
tcagaataat tatcacagga acttaaatgt taagccatta gcatagaaaa actgttctca 2460 
gttttatttt tacctaacac taacatgagt aacctaaggg aagtgctgaa tggtgttggc 2520 
aggggtatta aacgtgcatt tttactcaac tacctcaggt attcagtaat acaatgaaaa 2580 
gcaaaattgt tccttttttt tgaaaatttt atatacttta taatgataga agtccaaccg 2640 
ttttttaaaa aataaattta aaatttaaca gcaatcagct aacaggcaaa ttaagatttt 2700 
tacttctggc tggtgacagt aaagctggaa aattaatttc agggtttttt gaggcttttg 2760 
acacagttat tagttaaatc aaatgttcaa aaatacggag cagtgcctag tatctggaga 2820 
gcagcactac catttattct ttcatttata gttgggaaag tttttgacgg tactaacaaa 2880 
gtggtcgcag gagattttgg aacggctggt ttaaatggct tcaggagact tcagtttttt 2940 
gtttagctac atgattgaat gcataataaa tgctttgtgc ttctgactat caatacctaa 3000 
agaaagtgca tcagtgaaga gatgcaagac tttcaactga ctggcaaaaa gcaagcttta 3060 
gcttgtctta taggatgctt agtttgccac tacacttcag accaatggga cagtcataga 3120 
tggtgtgaca gtgtttaaac gcaacaaaag gctacatttc catggggcca gcactgtcat 3180 
gagcctcact aagctatttt gaagattttt aagcactgat aaattaaaaa aaaaaaaaaa 3240 
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aaattagact ccaccttaag tagtaaagta taacaggatt tctgtatact gtgcaatcag 3300 
ttctttgaaa aaaaagtcaa aagatagaga atacaagaaa agttttnggg atataatttg 3360 
.aatgactgtg aaaacatatg acctttgata acgaactcat ttgctcactc cttgacagca 3420 
aagcccagta cgtacaattg tgttgggtgt gggtggtctc caaggccacg ctgctctctg 3480 
aattgatttt ttgagttttg gnttgnaaga tgatcacagn catgttacac tgatcttnaa 3540 
ggacatatnt tataaccctt taaaaaaaaa atcccctgcc tcattcttat ttcgagatga 3600 
atttcgatac agactagatg tctttctgaa gatcaattag acattntgaa aatgatttaa 3660 
agtgttttcc ttaatgttct ctgaaaacaa gtttcttttg tagttttaac caaaaaagtg 3720 
ccctttttgt cactggtttc tcctagcatt catgattttt ttttcacaca atgaattaaa 3780 
attgctaaaa tcatggactg gctttctggt tggatttcag gtaagatgtg tttaaggcca 3840 
gagcttttct cagtatttga tttttttccc caatatttga ttttttaaaa atatacacat 3900 
aggagctgca tttaaaacct gctggtttaa attctgtcan atttcacttc tagcctttta 3960 
gtatggcnaa tcanaattta cttttactta agcatttgta atttggagta tctggtacta 4020 
gctaagaaat aattcnataa ttgagttttg tactcnccaa anatgggtca ttcctcatgn 4080 
ataatgtncc cccaatgcag cttcattttc caganacctt gacgcaggat aaattttttc 4140 
atcatttagg tccccaaaaa aaaaaaaaaa aaaaaaaaaa a 418I 

<210> 176 
<211> 579 
<212> PRT 

<213> Homo sapiens 
<400> 176 

Met Asn Lys Leu Tyr lie Gly Asn Leu Ser Glu Asn Ala Ala Pro Ser 

1 5 10 15 

Asp Leu Glu Ser He Phe Lys Asp Ala Lys He Pro Val Ser Gly Pro 

20 25 30 

Phe Leu Val Lys Thr Gly Tyr Ala Phe Val Asp Cys Pro Asp Glu Ser 

35 40 45 

Trp Ala Leu Lys Ala He Glu Ala Leu Ser Gly Lys He Glu Leu His 

50 55 60 

Gly Lys Pro He Glu Val Glu His Ser Val Pro Lys Arg Gin Arg He 
65 70 75 80 

Arg Lys Leu Gin He Arg Asn He Pro Pro His Leu Gin Trp Glu Val 

85 90 95 

Leu Asp Ser Leu Leu Val Gin Tyr Gly Val Val 1 Glu Ser Cys Glu Gin 

100 105 no 

Val Asn Thr Asp Ser Glu Thr Ala Val Val Asn Val Thr Tyr Ser Ser 

115 120 125 

Lys Asp Gin Ala Arg Gin Ala Leu Asp Lys Leu Asn Gly Phe Gin Leu 

130 135 140 

Glu Asn Phe Thr Leu Lys Val Ala Tyr He Pro Asp Glu Met Ala Ala 
145 150 155 160 

Gin Gin Asn Pro Leu Gin Gin Pro Arg Gly Arg Arg Gly Leu Gly Gin 

. 165 170 175 

Arg Gly Ser Ser Arg Gin Gly Ser Pro Gly Ser Val Ser Lys Gin Lys 

180 185 190 

Pro Cys Asp Leu Pro Leu Arg Leu Leu Val Pro Thr Gin Phe Val Gly 

195 200 205 

Ala He He Gly Lys Glu Gly Ala Thr He Arg Asn He Thr Lys Gin 

210 215 220 

Thr Gin Ser Lys lie Asp Val His Arg Lys Glu Asn Ala Gly Ala Ala 
225 230 235 ^ 240 

Glu Lys Ser He Thr He Leu Ser Thr Pro Glu Gly Thr Ser Ala Ala 

245 250 255 

Cys Lys Ser He Leu Glu He Met His Lys Glu Ala Gin Asp He Lys 

260 265 270 

Phe Thr Glu Glu Il e Pro Leu Lys He Leu Ala His Asn Asn Phe Val 
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275 


280 




285 




Gly 


Arg 


Leu lie Gly 


Lys Glu Gly 


Arg Asn 


Leu Lys Lys He 


Glu Gin 




290 




295 




300 




Asp 


Thr 


Asp Thr Lys 


He Thr He 


Ser Pro 


Leu Gin Glu Leu 


Thr Leu 


305 






310 




315 


320 


Tyr 


Asn 


Pro Glu Arg 


Thr He Thr 


Val Lys 


Gly Asn Val Glu 


Thr Cys 






325 




330 




335 


Ala 


Lys 


Ala Glu Glu 


Glu He Met 


Lys Lys 


He Arg Glu Ser 


Tyr Glu 






340 




345 


350 




Asn 


Asp 


lie Ala Ser 


Met Asn Leu 


Gin Ala 


His Leu He Pro 


Gly Leu 






355 


360 




365 




Asn 


Leu 


Asn Ala Leu 


Gly Leu Phe 


Pro Pro 


Thr Ser Gly Met 


Pro Pro 




370 




375 




380 




Pro 


Thr 


Ser Gly Pro 


Pro Ser Ala 


Met Thr 


Pro Pro Tyr Pro 


Gin Phe 


385 






390 




395 


400 


Glu 


Gin 


Ser Glu Thr 


Glu Thr Val 


His Gin 


Phe He Pro Ala 


Leu Ser 






405 




410 




415 


Val 


Gly 


Ala He He 


Gly Lys Gin 


Gly Gin 


His He Lys Gin 


Leu Ser 






420 




425 


430 




Arg 


Phe 


Ala Gly Ala 


Ser He Lys 


He Ala 


Pro Ala Glu Ala 


Pro Asp 






435 


440 




445 




Ala 


Lys 


Val Arg Met 


Val He He 


Thr Gly 


Pro Pro Glu Ala 


Gin Phe 




450 




455 




460 




Lys 


Ala 


Gin Gly Arg 


He Tyr Gly 


Lys He 


Lys Glu Glu Asn 


Phe Val 


465 






470 




475 


480 


Ser 


Pro 


Lys Glu' Glu 


Val Lys Leu 


Glu Ala 


His lie Arg Val 


Pro Ser 






485 




490 




495 


Pne 


Ala 


Ala Gly Arg 


Val lie Gly 


Lys Caly 


0»±y Lys lnr vax 


Asn Glu 






500 




505 


510 




Leu 


Gin 


Asn Leu Ser 


Ser Ala Glu 


Val Val 


Val Pro Arg Asp 


Gin Thr 






515 


520 




525 




Pro 


Asp 


Glu Asn Asp 


Gin Val Val 


Val Lys 


He Thr Gly His 


Phe Tyr 




530 




535 




540 




Ala 


Cys 


Gin Val Ala 


Gin Arg Lys 


He Gin 


Glu He Leu Thr 


Gin Val 


545 




550 




555 


560 


Lys 


Gin 


His Gin Gin 


Gin Lys Ala 


Leu Gin 


Ser Gly Pro Pro 


Gin Ser 






565 




570 




575 


Arg 


Arg 


Lys 











<210> 177 
<211> 401 
<212> DNA 
<213> Homo sapiens 

<400> 177 

atgccccgta aatgtcttca gtgttcttca 
agatccaaac aaatacacat tctgtgtttt 
cacacagcaa aaaattgttt actttgttgg 
ggtgcttata aaaagttata aatatcgagt 
gaagtgagct tgtgcttagt atttacattg 
gcaaactggt gcagaaattc tataaactct 
attttgtttt gttttgtaaa aatgataaaa 

<210> 178 
<211> 561 
<212> DNA 



gggtagttgg gatctcaaaa gatttggttc 60 
agctcagtgt tttctaaaaa aagaaactgc 120 
acaaaccaaa tcagttctca aaaaatgacc 180 
agctctaaaa caaaccacct gaccaagagg 240 
gatgccagtt ttgtaatcac t!gacttatgt 300 
ttgctgtttt tgatacctgc tttttgtttc 360 
cttcagaaaa t 401 
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<213> Homo sapiens 
<<400> 178 

acgcctttca agggtgtacg caaagcactc attgataccc ttttggatgg ctatgaaaca 60 
gcccgctatg ggacaggggt ctttggccag aatgagtacc tacgctatca ggaggccctg 120 
agtgagctgg ccactgcggt taaagcacga attgggagct ctcagcgaca tcaccagtca 180 
gcagccaaag acctaactca gtcccctgag gtctccccaa caaccatcca ggtgacatac 240 
ctcccctcca gtcagaagag taaacgtgcc aagcacttcc ttgaattgaa gagctttaag 300 
gataactata acacattgga gagtactctg tgacggagct gaaggactct tgccgtagat 360 
taagccagtc agttgcaatg tgcaagacag gctgcttgcc gggccgccct cggaacatct 420 
ggcccagcag gcccagactg tatccatcca agttcccgtt gtatccagag ttcttagagc 480 
ttgtgtctaa agggtaattc cccaaccctt ccttatgagc atttttagaa cattggctaa 540 
gactattttc ccccagtagc g 5^ 

<210> 179 
<211> 521 
<212> DNA 
<213> Homo sapiens 

<400> 179 

cccaacgcgt ttgcaaatat tcccctggta gcctacttcc ttacccccga atattggtaa 60 
gatcgagcaa tggcttcagg acatgggttc tcttctcctg tgatcattca agtgctcact 120 
gcatgaagac tggcttgtct cagtgtttca acctcaccag ggctgtctct tggtccacac 180 
ctcgctccct gttagtgccg tatgacagcc cccatcaaat gaccttggcc aagtcacggt 240 
ttctctgtgg tcaaggttgg ttggctgatt ggtggaaagt agggtggacc aaaggaggcc 300 
acgtgagcag tcagcaccag ttctgcacca gcagcgcctc cgtcctagtg ggtgttcctg 360 
tttctcctgg ccctgggtgg gctagggcct gattcgggaa gatgcctttg cagggagggg 420 
aggataagtg ggatctacca attgattctg gcaaaacaat ttctaagatt tttttgcttt 480 
atgtgggaaa cagatctaaa tctcatttta tgctgtattt t " 521 

<210> 180 
<211> 417 
<212> DNA 
<213> Homo sapiens 

<400> 180 « 

ggtggaattc gccgaagatg gcggaggtgc aggtcctggt gcttgatggt cgaggccatc 60 
tcctgggccg cctggcggcc atcgtggcta aacaggtact gctgggccgg aaggtggtgg 120 
tcgtacgctg tgaaggcatc aacatttctg gcaatttcta cagaaacaag ttgaagtacc 180 
tggctttcct ccgcaagcgg atgaacacca acccttcccg aggcccctac cacttccggg 240 
cccccagccg catcttctgg cggaccgtgc gaggtatgct gccccacaaa accaagcgag 300 
gccaggccgc tctggaccgt ctcaaggtgt ttgacggcat cccaccgccc tacgacaaga 360 
aaaagcggat ggtggttcct gctgccctca aggtcgtgcg tctgaagcct acaagaa 417 

<210> 181 
<211> 283 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<222> 35 < 
<223> n = A,T,C or G 

<400> 181 

gatttcttct aaataggatg taaaacttct ttcanattac tcttcctcag tcctgcctgc 60 
caagaactca agtgtaactg tgataaaata acctttccca ggtatattgg caggtatgtg 120 
tgtaatctca gaatacacag gtgacataga tatgatatga caactggtaa tggtggattc 180 
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atttacattg tttacacttc tatgaccagg ccttaaggga aggtcagttt tttaaaaaac 240 
caagtagtgt cttcctacct atctccagat acatgtcaaa aaa 283 

<210> 182 
<211> 401 
<212> DNA 
<213> Homo sapiens 

<400> 182 

atattcttgc tgcttatgca gctgacattg ttgccctccc taaagcaacc aagtagcctt 60 
tatttcccac agtgaaagaa aacgctggcc tatcagttac attacaaaag gcagatttca 120 
agaggattga gtaagtagtt ggatggcttt cataaaaaca agaattcaag aagaggattc 180 
atgctttaag aaacatttgt tatacattcc tcacaaatta tacctgggat aaaaactatg 240 
tagcaggcag tgtgttttcc ttccatgtct ctctgcacta cctgcagtgt gtcctctgag 300 
gctgcaagtc tgtcctatct gaattcccag cagaagcact aagaagctcc accctatcac 360 
ctagcagata aaactatggg gaaaacttaa atctgtgcat a 401 

<210> 183 
<211> 366 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 325 

<223> n = A,T,C or G 
<400> 183 

accgtgtcca agtttttaga acccttgtta gccagaccga ggtgtcctgg tcaccgtttc 60 
accatcatgc tttgatgttc ccctgtcttt ctctcttctg ctctcaagag caaaggttaa 120 
tttaaggaca aagatgaagt cactgtaaac taatctgtca ttgtttttac cttccttttc 180 
tttttcagtg cagaaattaa aagtaagtat aaagcaccgt gattgggagt gtttttgcgt 240 
gtgtcggaat cactggtaaa tgttggctga gaacaatccc tccccttgca cttgtgaaaa 300 
cactttgagc gctttaagag attancctga gaaataatta aatatctttt ctcttcaaaa 360 
aaaaaa 366 

<210> 184 
<211> 370 
<212> DNA 
<213> Homo sapiens 



<400> 184 

tcttacttca aaagaaaaat aaacataaaa 
tttaataatt gtactgagag aaactgctta 
taaaatgtta gtctacatag atgggtgatt 
ttgcattcat gcttctgtgt acacataatg 
tcagtctgct ctgtttaatt ctgctgtctg 
cacagtttag tgatatctag gagtataaag 
ggtttaaaaa 



aataagttgc tggttcctaa caggaaaaat 60 
cgtacacatt gcagatcaaa tatttggagt 120 
gtaactttat tgccattaaa agatttcaaa 180 
aaaaatgggc aaataatgaa gatctctcct 240 
ctcttctcta atgctgcgtc cctaattgta 300 
ttgtcgccca tcaataaaaa tcacaaagtt 360 

. 370 



<210> 185 

<211> 107 1 

<212> DNA 

<213> Homo sapiens 

<400> 185 

ctcatattat tttccttttg agaaattgga aactctttct gttgctatta tattaataaa 60 
gttggtgttt attttctggt agtcaccttc cccatttaaa aaaaaaa 107 
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<210> 186 
^<2ll> 309 
<212> DNA 
<213> Homo sapiens 

<400> 186 

gaaaggatgg ctctggttgc cacagagctg ggacttcatg ttcttctaga gagggccaca 60 
agagggccac aggggtggcc gggagttgtc agctgatgcc tgctgagagg caggaattgt 120 

, gccagtgagt gacagtcatg agggagtgtc tcttcttggg gaggaaagaa ggtagagcct 180 

ttctgtctga atgaaaggcc aaggctacag tacagggccc cgccccagcc agggtgttaa 240 

tgcccacgta gtggaggcct ctggcagatc ctgcattcca aggtcactgg actgtacgtt 300 
tttatggtt 309 

<210> 187 
<211> 477 
<212> DNA 
<213> Homo sapiens 

<400> 187 

ttcagtccta gcaagaagcg agaattctga gatcctccag aaagtcgagc agcacccacc 60 
tccaacctcg ggccagtgtc ttcaggcttt actggggacc tgcgagctgg cctaatgtgg 120 
tggcctgcaa gccaggccat ccctgggcgc cacagacgag ctccgagcca ggtcaggctt 180 
cggaggccac aagctcagcc tcaggcccag gcactgattg tggcagaggg gccactaccc 240 
aaggtctagc taggcccaag acctagttac ccagacagtg agaagcccct ggaaggcaga 300 
aaagttggga gcatggcaga cagggaaggg aaacattttc agggaaaaga catgtatcac 360 
atgtcttcag aagcaagtca ggtttcatgt aaccgagtgt cctcttgcgt ' gtccaaaagt 420 
agcccagggc tgtagcacag gcttcacagt gattttgtgt tcagccgtga gtcacac 477 

<210> 188 
<211> 220 
<212> DNA 
<213> Homo sapiens 

<400> 188 

taaatatggt agatattaat attcctctta gatgaccagt gattccaatt gtcccaagtt 60 
ttaaataagt accctgtgag tatgagataa attagtgaca atcagaacaa gtttcagtat 120 
cagatgttca agaggaagtt gctattgcat tgattttaat atttgtacat aaacactgat 180 
ttttttgagc attattttgt atttgttgta ctttaatacc 220 

<210> 189 

<211> 417 ' 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 76, 77 

<223> n ~ A,T,C or G 

<400> 189 

accatcttga cagaggatac atgctcccaa aacgtttgtt accacactta aaaatcactg 60 
ccatcattaa gcatcnnttt caaaattata gccattcatg atttactttt tccagatgac 120 
tatcattatt ctagtccttt gaatttgtaa ggggaaaaaa aacaaaaaca aaaacttacg 180 
atgcactttt ctccagcaca tcagatttca aattgaaaat taaagacatg ctatggtaat 240 
gcacttgcta gtactacaca ctttgtacaa caaaaaacag aggcaagaaa caacggaaag 300 
agaaaagcct tcctttgttg gcccttaaac tgagtcaaga tctgaaatgt agagatgatc 360 
tctgacgata cctgtatgtt cttattgtgt aaataaaatt gctggtatga aatgaca 417 
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<210> 190 
;<211> 497 
<212> DNA 
<213> Homo sapiens 



<400> 190 

gcactgcggc gctctcccgt cccgcggtgg ttgctgctgc tgccgctgct gctgggcctg 60 
aacgcaggag ctgtcattga ctggcccaca gaggagggca aggaagtatg ggattatgtg 120 
acggtccgca aggatgccta catgttctgg tggctctatt atgccaccaa ctcctgcaag 180 
aacttctcag aactgcccct ggtcatgtgg cttcagggcg gtccaggcgg ttctagcact 240 
ggatttggaa actttgagga aattgggccc cttgacagtg atctcaaacc acggaaaacc 300 
acctggctcc aggctgccag tctcctattt gtggataatc ccgtgggcac tgggttcagt 360 
tatgtgaatg gtagtggtgc ctatgccaag gacctggcta tggtggcttc agacatgatg 420 
gttctcctga agaccttctt cagttgccac aaagaattcc agacagttcc attctacatt 480 
ttctcagagt cctatgg 497 

<210> 191 
<211> 175 
<212> DNA 
<213> Homo sapiens 

<400> 191 

atgttgaata ttttgcttat taactttgtt tattgtcttc tccctcgatt agaatattag 60 
ctacttgagt acaaggattt gagcctgtta cattcactgc tgaattttag gctcctggaa 120 
gatacccagc attcaataga gaccacacaa taaatatatg tcaaataaaa aaaaa 175 

<210> 192 
<211> 526 
<212> DNA 
<213> Homo sapiens 



<400> 192 

agtaaacatt attatttttt ttatatttgc 
aagaacagta ttgctgtaat tccttttctt 
attgaagaaa gagaaacttg tcaactcata 
ctatcactaa gtaatgtatc cttcagaatg 
tcacaaaatt aaagcaagaa gtccatagta 
tcagagtttc tgaggtcaaa ttttatcttt 
ttacttaatg tattttggtg tattttcctc 
aattcctctg atcactttga gaaacaaact 
ttttaaatat aaaaataaat attgttctga 

<210> 193 
<211> 553 
<212> DNA 
<213> Homo sapiens 



aaaggaaaca tatctaatcc ttcctataga 60 
ttcttcctca tttcctctgc cccttaaaag 120 
tccacgttat ctagcaaagt acataagaat 180 
tgttggttta ccagtgacac cccatattca 240 
atttatttgc taatagtgga tttttaatgc 300 
tcacttacaa gctctatgat cttaaataat 360 
aaattaatat tggtgttcaa gactatatct 420 
tttattaaat gtaaggcact tttctatgaa 480 
ttattactga aaaaaa 526 



<220> 

<221> misc_feature 
<222> 290, 300, 411, 441 
<223> n « A,T,C or G 



<400> 193 

tccattgtgg tggaattcgc tctctggtaa 
gctgggatga gccgtgctcc cggtggaagc 
cagtggtagc agttggactg accattgctg 
aagccatgaa gcatatggag cctcaagtaa 



aggcgtgcag gtgttggccg cggcctctga 60 

aagggagccc agccggagcc atggccagta 120 

ctgcaggatt tgcaggccgt tacgttttgc 180 

aacaagtttt tcaaagccta ccaaaatctg 240 
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ccttcagtgg tggctattat agaggtgggt ttgaacccaa aatgacaaan cgggaagcan 300 
cattaatact aggtgtaagc cctactgcca ataaagggaa aataagagat gctcatcgac 360 
'gaattatgct tttaaatcat cctgacaaag gaggatctcc ttatatagca nccaaaatca 420 
atgaagctaa agatttacta naaggtcaag ctaaaaaatg aagtaaatgt atgatgaatt 480 
ttaagttcgt attagtttat gtatatgagt actaagtttt tataataaaa tgcctcagag 540 
ctacaatttt aaa 553 

<210> 194 
<211> 320 
f <212> DNA 

<213> Homo sapiens 

<400> 194 

cccttcccaa tccatcagta aagaccccat ctgccttgtc catgccgttt cccaacaggg 60 

atgtcacttg atatgagaat ctcaaatctc aatgccttat aagcattcct tcctgtgtcc 120 

attaagactc tgataattgt ctcccctcca taggaatttc tcccaggaaa gaaatatatc 180 

cccatctccg tttcatatca gaactaccgt ccccgatatt cccttcagag agattaaaga 240 

ccagaaaaaa gtgagcctct tcatctgcac ctgtaatagt ttcagttcct attttcttcc 300 
attgacccat atttatacct 3 2 q 

<210> 195 
<211> 320 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 203, 218 
<223> n = A,T,C or G 

<400> 195 

aagcatgacc tggggaaatg gtcagacctt gtattgtgtt tttggccttg aaagtagcaa 60 
gtgaccagaa tctgccatgg caacaggctt taaaaaagac ccttaaaaag acactgtctc 120 

aactgtggtg ttagcaccag ccagctctct gtacatttgc tagcttgtag ttttctaaga 180 

ctgagtaaac ttcttatttt tanaaagggg aggctggntt gtaactttcc ttgtacttaa 240 

ttgggtaaaa gtcttttcca caaaccacca tctattttgt gaactttgtt agtcatcttt 300 
tatttggtaa attatgaact " ^20 

<210> 196 
<211> 357 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> mis cofeature 
<222> 36 

<223> n = A, T, C or G 
<400> 196 

atataaaata atacgaaact ttaaaaagca ttggantgtc agtatgttga atcagtagtt 60 
tcactttaac tgtaaacaat ttcttaggac accatttggg ctagtttctg tgtaagtgta 120 
aatactacaa aaacttattt atactgttct tatgtcattt gttatattca tagatttata 180 
tgatgatatg acatctggct aaaaagaaat tattgcaaaa ctaaccacta tgtacttttt 240 
tataaatact gtatggacaa aaaatggcat tttttatatt aaattgttta gctctggcaa 300 
aaaaaaaaaa ttttaagagc tggtactaat aaaggattat tatgactgtt aaaaaaa 357 

<210> 197 
<211> 565 
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<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 27 

<223> n = A,T,C or G 
<400> 197 

tcagctgagt accatcagga tatttanccc 
aagcaacaat acttcctctt gacagctttg 
tggtcctaca ctttttagga tgcttggtga 
gttcctatat tttgggctat gtgggtagga 
agaaagtaag cccagggctt cagatctaag 
agttgtaatg ctaggcataa gcactctata 
gaatgtttct gaaacattaa acttgtattt 
aaatgtgtct catacatatg ctgtactagg 
atttgaatat atgaaagaat ttatacaaga 
atataatttg tacctattgt aaaaa 

<210> 198 
<211> 484 
<212> DNA 
<213> Homo sapiens 

. <400> 198 

tatgtaagta ttggtgtctg ctttaaaaaa 
acatttgaga acagtgttac tctgagcagt 
ctgttggatg tgtccattgt cgccagtttg 
tgggcgcagc agcaggtggc aggggtgtgg 
tctctggtgc tttctgagag ggtctctaaa 
agcacgtatt tctcccctct agtacctctg 
agggcagcag actcttgagt atactgcaga 
tccaggggct caactgacca agtaacacag 
aaac 

<210> 199 
<211> 429 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 77 r 88, 134, 151, 189, 227, 

<223> n = A, T, C or G 

<400> 199 

gcttatgttt tttgttttaa cttttgtttt 

tacagtacct ttctcanaca ttttgtanaa 

gaacattaaa aagngtgata gcgatattag 

ataaacaana cacaacgttt ttatacaaca 

attgtttcct attaagtatt attctttggg 

caatttagca tttgctttng gtttttttct 

tatgtactgt atgggaaatg ttgtaaatat 
tgaatccaa 



tttaagtgct gttttgggag tagaaaacta 60 
attggaatgg ggttattaga tcattcacct 120 
acataacacc acttataatg aacatccctg 180 
attgttactt gttactgcag cagcagccct 240 
ttagtccaaa agctaaatga tttaaagtca 300 
atacattaaa ttataggccg agcaattagg 360 
atgtcactaa aattctaaca caaacttaaa 420 
cttcatcatg catttctaaa tttgtgtatg 480 
gtgttattta aaattattaa aaataaatgt 540 

565 



ggagacccag acttcacctg tcctttttaa 60 
tgggccacct tcaccttatc cgacagctga 120 
gctgttgccc ggacaggaca ggacctccat 180 
cttgaggtgg gtggcagcgt ctggtcctcc 240 
gcagagtgtg gttggcctgg gggaaggcag 300 
catttgtgag tgttccctct ggctttctga 360 
ggacatgctt tatcagtagg tcctgagggc 420 
aagttggggt atgtggccta tttgggtcgg 480 
, 484 



274, 319 



ttaacattta gaatattaca ttttgtatta 60 

ttcatttcgg cagctcacta ggattttgct 120 

ngccaatcaa atggaaaaaa ggtagtctta 180 

tactttaaaa tattaanaaa actccttaat 240 

caanattttc tgatgctttt gattttctct 300 

ctatttagca ttctgttaag gcacaaaaac 360 

taccttttcc acattttaaa cagacaactt 420 

429 



<210> 200 
<211> 279 
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<212> DNA 

<213> Homo sapiens 

<400> 200 

gcttttttga ggaattacag ggaagctcct ggaattgtac atggatatct ttatccctag 60 

ggggaaatca aggagctggg cacccctaat tctttatgga agtgtttaaa actattttaa 120 

ttttattaca agtattacta gagtagtggt tctactctaa gatttcaaaa gtgcatttaa 180 

aatcatacat gttcccgcct gcaaatatat tgttattttg gtggagaaaa aaatagtata 240 
ttctacataa aaaattaaag atattaacta agaaaaaaa 279" 

<210> 201 
<211> 569 
<212> DNA 
<213> Homo sapiens 

<400> 201 

taggtcagta tttttagaaa ctcttaatag ctcatactct tgataccaaa agcagecctg 60 

attgttaaag cacacacctg cacaagaagc agtgatggtt gcatttacat ttcctgggtg 120 

cacaaaaaaa aattctcaaa aagcaaggac ttacgctttt tgcaaagcct ttgagaagtt 180 

actggatcat aggaagctta taacaagaat ggaagattct taaataactc actttctttg 240 

gtatccagta acagtagatg ttcaaaatat gtagctgatt aataccagca ttgtgaacgc 300 

tgtacaacct tgtggttatt actaagcaag ttactactag cttctgaaaa gtagcttcat 360 

aattaatgtt atttatacac tgccttccat gacttttact ttgccctaag ctaatctcca 420 

aaatctgaaa tgctactcca atatcagaaa aaaaggggga ggtggaatta tatttcctgt 480 

gattttaaga gtacagagaa tcatgcacat ctctgattag ttcatatatg tctagtgtgt 540 
aataaaagtc aaagatgaac tctcaaaaa 559 

<210> 202 
<211> 501 
<212> DNA 
<2ll3> Homo sapiens 

<400> 202 

attaataggc ttaataattg ttggcaagga tccttttgct ttctttggca tgcaagctcc 60 
tagcatctgg cagtggggcc aagaaaataa ggtttatgca tgtatgatgg ttttcttctt 120 
gagcaacatg attgagaacc agtgtatgtc aacaggtgca tttgagataa ctttaaatga 180 
tgtacctgtg tggtctaagc tggaatctgg tcaccttcca tccatgcaac aacttgttca 240 
aattcttgac aatgaaatga agctcaatgt gcatatggat tcaatcccac accatcgatc 300 
atagcaccac ctatcagcac tgaaaactct tttgcattaa gggatcattg caagagcagc 360 
gtgactgaca ttatgaaggc ctgtactgaa gacagcaagc tgttagtaca gaccagatgc 420 
tttcttggca ggctcgttgt acctcttgga aaacctcaat gcaagatagt gtttcagtgc 480 
tggcatattt tggaattctg c 50i 

<210> 203 
<211> 261 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 36, 96 , 
<223> n = A, T,C or G 



<400> 203 

gacaagctcc tggtcttgag atgtcttctc 
gataaaatga atgagttctg tcatgattca 
gttagctctt tgaatgttct tgaaatttta 
tatcattgta taaaagctgt tatgtgcaac 



gttaangaga tgggcctttt ggaggtaaag 60 
ctattntata acttgcatga cctttactgt 120 
gactttcttt gtaaacaaat gatatgtcct 180 
agtgtggaga ttccttgtct gatttaataa 240 
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aatacttaaa cactgaaaaa a 261 

; <210> 204 
<211> 421 
<212> DNA 
<213> Homo sapiens 



<400> 204 

agcatctttt ctacaacgtt aaaattgcag 
caacaataac aataaatcct aagtgtaaat 
gcctgttttt tccctttttt ctcctgggaa 
gcctctttcc tcttctcatg cttgagcttc 
gcttgtgtgc ttggactcgg ctccaggtgg 
aactcaaacc ttcaagccct aggtgtagcc 
actggcatta acaaaaaaag aagataaaat 
a 

<210> 205 
<211> 460 
<212> DNA 
<213> Homo sapiens 



aagtagctta tcattaaaaa acaacaacaa 60 
cagttattct accccctacc aaggatatca 120 
taattgtggg cttcttccca aatttctaca 180 
cctgtttgca cgcatgcgtg tgcaggactg 240 
aagcatgctt tcccttgtta ctgttggaga 300 
attttgtcaa gtcatcaact gtatttttgt 360 
attgtaccat taaactttaa taaaacttta 420 

421 



<400> 205 

tactctcaca atgaaggacc tggaatgaaa 
tttagtgcaa atccagagcc agcgtcggtt 
ggaaaagctc tcaggagacc tcacctagat 
tgtcagccaa gagcctttta tttgaaagct 
gaggaagatg ggaaagaaag gacagatttt 
cagactttag aaaactacag gactccaaat 
gaatgagacc aaaggaaaag cttaacatac 
agagaatctt atgtttttta aatggagtta 



aatctgtgtc taaacaagtc ctctttagat 60 
gcctcgagta attctttcat gggtaccttt 120 
gcctattcaa gctttggaca gccatcagat 180 
cattcttccc cagacttgga ctctgggtca 240 
caggaagaaa atcacatttg tacctttaaa 300 
tttcagtctt atgacttgga cacatagact 360 
tacctcaagg tgaactttta tttaaaagag 420 
tgaattttaa 460 



<210> 206 
<211> 481 
<212> DNA 
<213> Homo sapiens 



<400> 206 

tgtggtggaa ttcgggacgc ccccagaccc 
tgcggaagca gtgacctctg acccctggtg 
gtcccgcggg acttggtttt ctcaagctct 
cgcctgccct gggtggatac ttgaacccca 
cggccttccc atctgcctgc ccacccggag 
acctcccgcc ctcagtcctg cggtgtgcgt 
cctggcctcc gcgcccgccc gcccacgcga 
ggtgtgaccc cctggaggtg ccctcggccc 
t 



tgactttttc ctgcgtgggc cgtctcctcc 60 
accttcgctt tgagtgcctt ttgaacgctg 120 
gtctgtccaa agacgctccg gtcgaggtcc 180 
gacgcccctc tgtgctgctg tgtccggagg 240 
ctctttccgc cggcgcaggg tcccaagccc 300 
ctgggcacgt cctgcacaca caatgcaagt 360 
gccgtacccg ccgccaactc tgttatttat 420 
accggggcta tttattgttt aatttatttg 480 

481 



<210> 207 
<211> 605 
<212> DNA 
<213> Homo sapiens 

<400> 207 

accctttttg gattcagggc tcctcacaat 
tatagaagca tccctttgta tactgttttg 
ctcactggat tctcacggta ggatttctga 
ttttttgatc ctagggtgct ccttttgttt 



taaaatgagt gtaatgaaac aaggtgaaaa 60 
ctacttacag tgtacttggc attgctttat 120 
gatcttaatc taagctccaa agttgtctac 180 
tacagagcag ggtcacttga tttgctagct 240 
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ggtggcagaa ttggcaccat tacccaggtc tgactgacca ccagtcagag gcactttatt 300 

tgtatcatga aatgatttga aatcattgta aagcagcgaa gtctgataat gaatgccagc 360 

"tttccttgtg ctttgataac aaagactcca aatattctgg agaacctgga taaaagtttg 420 

aagggctaga ttgggatttg aagacaaaat tgtaggaaat cttacatttt tgcaataaca 480 

aacattaatg aaagcaaaac attataaaag taattttaat tcaccacata cttatcaatt 540 

tcttgatgct tccaaatgac atctaccaga tatggttttg tggacatctt tttctgttta 600 

cataa " 605 

<210> 208 
<211> 655 
<212> DNA 

<213> Homo sapiens 
<400> 208 

ggcgttgttc tggattcccg tcgtaactta aagggaaact ttcacaatgt ccggagccct 60 
tgatgtcctg caaatgaagg aggaggatgt ccttaagttc cttgcagcag gaacccactt 120 
aggtggcacc aatcttgact tccagatgga acagtacatc tataaaagga aaagtgatgg 180 
catctatatc ataaatctca agaggacctg ggagaagctt ctgctggcag ctcgtgcaat 240 
tgttgccatt gaaaaccctg ctgatgtcag tgttatatcc tccaggaata ctggccagag 300 
ggctgtgctg aagtttgctg ctgccactgg agccactcca attgctggcc gcttcactcc 360 
tggaaccttc actaaccaga tccaggcagc cttccgggag ccacggcttc ttgtggttac 420 
tgaccccagg gctgaccacc agcctctcac ggaggcatct tatgttaacc tacctaccat 480 
tgcgctgtgt aacacagatt ctcctctgcg ctatgtggac attgccatcc catgcaacaa 540 
caagggagct cactcagtgg gtttgatgtg gtggatgctg gctcgggaag ttctgcgcat 600 
gcgtggcacc atttcccgtg aacacccatg ggaggtcatg cctgatctgt acttc 655 

<210> 209 
<211> 621 
<212> DNA 

<213> Homo sapiens 
<400> 209 

catttagaac atggttatca tccaagacta ctctaccctg caacattgaa ctcccaagag 60 

caaatccaca ttcctcttga gttctgcagc ttctgtgtaa atagggcagc tgtcgtctat 120 

gccgtagaat cacatgatct gaggaccatt catggaagct gctaaatagc ctagtctggg 180 

gagtcttcca taaagttttg catggagcaa acaaacagga ttaaactagg tttggttcct 240 

tcagccctct aaaagcatag ggcttagcct gcaggcttcc ttgggctttc tctgtgtgtg 300 

tagttttgta aacactatag catctgttaa gatccagtgt ccatggaaac cttcccacat 360 

gccgtgactc tggactatat cagtttttgg aaagcagggt tcctctgcct gctaacaagc 420 

ccacgtggac cagtctgaat gtctttcctt tacacctatg tttttaaata gtcaaacttc 480 

aagaaacaat ctaaacaagt ttctgttgca tatgtgtttg tgaacttgta tttgtattta 540 

gtaggcttct atattgcatt taacttgttt ttgtaactcc tgattcttcc ttttcggata 600 

ctattgatga ataaagaaat t 621 

<210> 210 
<211> 533 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature , 
<222> 20, 21, 61 
<223> n = A, T, C or G 

<400> 210 

cgqcttgggg agccggcggn ngagtccggg acgtggagac ccggggtccc ggcagccggg 60 
nggcccgcgg gcccagggtg gggatgcacc gccgcggggt gggagctggc gccatcgcca 120 
agaagaaact tgcagaggcc aagtataagg agcgagggac ggtcttggct gaggaccagc 180 
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tagcccagat gtcaaagcag ttggacatgt tcaagaccaa cctggaggaa tttgccagca 240 
aacacaagca ggagatccgg aagaatcctg agttccgtgt gcagttccag gacatgtgtg 300 
^caaccattgg cgtggatccg ctggcctctg gaaaaggatt ttggtctgag atgctgggcg 360 
tgggggactt ctattacgaa ctaggtgtcc aaattatcga agtgtgcctg gcgctgaagc 420 
atcggaatgg aggtctgata actttggagg aactacatca acaggtgttg aagggaaggg 480 
gcaagttcgc ccaggatgtc agtcaagatg acctgatcag agccatcaag aaa 533 

<210> 211 
<211> 451 
<212> DNA 

<213> Homo sapiens 
<400> 211 

ttagcttgag* ccgagaacga ggcgagaaag ctggagaccg aggagaccgc ctagagcgga 60 
gtgaacgggg aggggaccgt ggggaccggc ttgatcgtgc gcggacacct gctaccaagc 120 
ggagcttcag caaggaagtg gaggagcgga gtagagaacg gccctcccag cctgaggggc 180 
tgcgcaaggc agctagcctc acggaggatc gggaccgtgg gcgggatgcc gtgaagcgag 240 
aagctgccct acccccagtg agccccctga aggcggctct ctctgaggag gagttagaga 300 
agaaatccaa ggctatcatt gaggaatatc tccatctcaa tgacatgaaa gaggcagtcc 360 
agtgcgtgca ggagctggcc tcaccctcct tgctcttcat ctttgtacgg catggtgtcg 420 
agtctacgct ggagcgcagt gccattgctc g 451 

<210> 212 

<211> 471 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 54 

<223> n - A,T,C or G 



<400> 212 

gtgattattc ttgatcaggg agaagatcat ttagatttgt tttgcattcc ttanaatgga 60 
gggcaacatt ccacagctgc cctggctgtg atgagtgtC9 ttgcaggggc cggagtagga 120 

gcactggggt gggggcggaa ttggggttac tcgatgtaag ggattccttg ttgttgtgtt 180 

gagatccagt gcagttgtga tttctgtgga tcccagcttg gttccaggaa ttttgtgtga 240 

ttggcttaaa tccagttttc aatcttcgac agctgggctg gaacgtgaac tcagtagctg 300 

aacctgtctg acccggtcac gttcttggat cctcagaact ctttgctctt gtcggggtgg 360 

gggtgggaac tcacgtgggg agcggtggct gagaaaatgt aaggattctg gaatacatat 420 

tccatgggac tttccttccc tctcctgctt cctcttttco tgctccctaa c 471 

<210> 213 
<211> 511 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 27, 63, 337, 442 

<223> n = A,T,C or G ? 
<400> 213 

ctaattagaa acttgctgta ctttttnttt tcttttaggg gtcaaggacc ctctttatag 60 
ctnccatttg cctacaataa attattgcag cagtttgcaa tactaaaata ttttttatag 120 
actttatatt tttccttttg ataaagggat gctgcatagt agagttggtg taattaaact 180 
atctcagccg tttccctgct ttcccttctg ctccatatgc ctcattgtcc ttccagggag 240 
ctcttttaat cttaaagttc tacatttcat gctcttagtc aaattctgtt acctttttaa 300 
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taactcttcc cactgcatat ttocatcttg aattggnggt tctaaattct gaaactgtag 360 
ttgagataca gctatttaat atttctggga gatgtgcatc cctcttcttt gtggttgccc 420 
aaggttgttt tgcgtaactg anactccttg atatgcttca gagaatttag gcaaacactg 480 
gccatggccg tgggagtact gggagtaaaa t 511 

<210> 214 
<211> 521 
<212> DNA 
<213> Homo sapiens 

<400> 214 

agcattgcca aataatccct aattttccac taaaaatata atgaaatgat gttaagcttt 60 
ttgaaaagtt taggttaaac ctactgttgt tagattaatg tatttgttgc ttccctttat 120 
ctggaatgtg gcattagctt ttttatttta accctcttta attcttattc aattccatga 180 
cttaaggttg gagagctaaa cactgggatt tttggataac agactgacag ttttgcataa 240 
ttataatcgg cattgtacat agaaaggata tggctacctt ttgttaaatc tgcactttct 300 
aaatatcaaa aaagggaaat gaagtataaa tcaatttttg tataatctgt ttgaaacatg 360 
agttttattt gcttaatatt agggctttgc cccttttctg taagtctctt gggatcctgt 420 
gtagaagctg ttctcattaa acaccaaaca gttaagtcca ttctctggta ctagctacaa 480 
attcggtttc atattctact taacaattta aataaactga a 521 

<210> 215 
<211> 381 
<212> DNA 

<213> Homo sapiens 
<220> . 

<221> misc_feature 

<222> 17, 20, 60, 61, 365 

<223> n = A, T,C or G 

<400> 215 

gagcggagag cggaccngtn agagccctga gcagccccac cgccgccgcc ggcctagttn 60 
ncatcacacc ccgggaggag ccgcagctgc cgcagccggc cccagtcacc atcaccgcaa 120 
ccatgagcag cgaggccgag acccagcagc cgcccgccgc cccccccgcc gcccccgccc 180 
tcagcgccgc cgacaccaag cccggcacta cgggcagcgg cgcagggagc ggtggcccgg 240 
gcggcctcac atcggcggcg cctgccggcg gggacaagaa ggtcatcgca acgaaggttt 300 
tgggaacagt aaaatggttc aatgtaagga acggatatgg tttcatcaac aggaatqaca 360 
ccaangaaga tgtatttgta c 3^ 

<210> 216 
<211> 425 
<212> DNA 

<213> Homo sapiens 
<400> 216 

ttactaacta ggtcattcaa ggaagtcaag ttaacttaaa catgtcacct aaatgcactt 60 
gatggtgttg aaatgtccac cttcttaaat ttttaagatg aacttagttc taaagaagat 120 
aacaggccaa tcctgaaggt actccctgtt tgctgcagaa tgtcagatat tttggatgtt 180 
gcataagagt cctatttgcc ccagttaatt caacttttgt ctgcctgttt tgtggactgg 240 
ctggctctgt tagaactctg tccaaaaagt gcatggaata taacttgtaa agcttcccac 300 
aattgacaat atatatgcat gtgtttaaac caaatccaga aagcttaaac aatagagctg 360 
cataatagta tttattaaag aatcacaact gtaaacatga gaataactta aggattctag 420 
ag 1 425 

<210> 217 
<211> 181 
<212> DNA 
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<213> Homo sapiens 



s <400> 217 

gagaaaccaa atgataggtt gtagagcctg atgactccaa acaaagccat cacccgcatt 60 

cttcctcctt cttctggtgc tacagctcca agggcccttc accttcatgt ctgaaatgga 120 

actttggctt tttcagtgga agaatatgtt gaaggtttca ttttgttcta gaaaaaaaaa 180 
a 181 

<210> 218 
' <211> 405 
<212> DNA 
<213> Homo sapiens 

<400> 218 

caggccttcc agttcactga caaacatggg 
agtgatacca tcaagcctga tgtccaaaag 
gcgctgggct gttttagtgc caggctgcgg 
tatttttttt ttccattagt aaaacacaag 
acaaggcagg cctttcctac agggggtgga 
ggcctgagtt ggcgttgtgg gcaggctact 
attaatcttt tgtagtttgt attaaacttg 



gaagtgtgcc cagctggctg gaaacctggc 60 
agcaaagaat atttctccaa gcagaagtga 120 
tgggcagcca tgagaacaaa acctcttctg 180 
acttcagatt cagccgaatt gtggtgtctt 240 
gagaccagcc tttcttcctt tggtaggaat 300 
ggtttgtatg atgtattagt agagcaaccc 360 
aactgagaaa aaaaa 405 



<210> 219 
<211> 216 
<212> DNA 
<213> Homo sapiens 



<220> 

<221> misc_feature 
<222> 207, 210 
<223> n - A,T,C or G 



<400> 219 

actccaagag ttagggcagc agagtggagc 
ttaatttacc atgtaaaatt gctgtaaatg 
tcaattgtaa acttcttgtt aagactgtta 
aaaaataaaa aggaaagaac cctcttnaan 

<210> 220 
<211> 380 
<212> DNA 
<213> Homo sapiens 



gatttagaaa gaacatttta aaacaatcag 60 

ataatgtgta cagattttct gttcaaatat 120 

cgtttctatt gcttttgtat gggatattgc 180 

aaaaaa 216 



<400> 220 

cttacaaatt gcccccatgt gtaggggaca 
tctgtacaaa gtctttgcct ttttccttct 
atttcatctt tgagggaaac tgattagatg 
gcaccccaag gactcagaag atgattttaa 
gcatgtaata atgttgagtg gcagtcaaaa 
tgcattgaaa aggaaaacct gtctgagaaa 
gtaagtcttt gacaaaaaaa 



cagaaccctt tgagaaaact tagatttttg 60 
tcattttttt ccagtacatt aaatttgtca 120 
ggttgtgttt gtgttctgat ggagaaaaca 180 
cagttcagaa cagatgtgtg caatattggt 240 
gtcatgattt ttatcttagt tcttcattac 300 
atgcctgaca gtttaattta aaactatggt 360 

t 380 



<210> 221 
<211> 398 
<212> DNA 
<213> Homo sapiens 



<400> 221 
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ggttagtaag ctgtcgactt tgtaaaaaag 
tgtatattta atgaatgaac atgtacaatt 
^gtgagtctgc aagtgaattt cactgatgtt 
cccagccccg tttcctttta ttttggagct 
agtaaaatag aatcagcaaa tcactcttat 
gtttctgtgg gagcagtgta caccaactct 
ttgtatgttg aataaaattt tctataaaaa 

<210> 222 
<211> 301 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 49, 64 

<223> n = A,T,CorG 



ttaaaaatga aaaaaaaagg aaaaatgaat 60 
tgccactggg aggaggttcc tttttgttgg 120 
gatattcatt gtgtgtagtt ttatttcggt 180 
aatgccagct gcgtgtctag ttttgagtgc 240 
ttttcatcct tttccggtat tttttgggtt 300 
tcctgtatat tgcctttttg ctggaaaatg 360 
ttaaaaaa ^ 398 



<400> 222 

ttcgataatt gatctcatgg gctttccctg gaggaaaggt tttttttgnt gtttattttt 60 

taanaacttg aaacttgtaa actgagatgt ctgtagcttt tttgcccatc tgtagtgtat 120 

gtgaagattt caaaacctga gagcactttt tctttgttta gaattatgag aaaggcacta 180 

gatgacttta ggatttgcat ttttcccttt attgcctcat ttcttgtgac gccttgttgg 240 

ggagggaaat ctgtttattt tttcctacaa ataaaaagct aagattctat atcgcaaaaa 300 
a 301 

<210> 223 
<211> 200 
<212> DNA 
<213> Homo sapiens 

<400> 223 

gtaagtgctt aggaagaaac tttgcaaaca tttaatgagg atacactgtt catttttaaa 60 
attccttcac actgtaattt aatgtgtttt atattctttt gtagtaaaac aacataactc 120 
agatttctac aggagacagt ggttttattt ggattgtctt ctgtaatagg tttcaataaa 180 
gctggatgaa cttaaaaaaa 5 200 

<210> 224 
<211> 385 
<212> DNA 
<213> Homo sapiens 

<400> 224 

gaaaggtttg atccggactc aaagaaagca aaggagtgtg agccgccatc tgctggagca 60 
gctgtaactg caagacctgg acaagagatt cgtcagcgaa ctgcagctca aagaaacctt 120 
tctccaacac cagcaagccc taaccagggc cqtcctccac aagttccagt atctcctgga 180 
ccaccaaagg acagttctgc ccctggtgga cccccagaaa ggactgttac tccagcccta 240 
tcatcaaatg tgttaccaag acatcttgga tcccctgcta cttcagtgcc tggaatgggt 300 
aaacagagca cttaatgtta tttacagttt atattgtttt ctctggttac caataaaacg 360 
ggccattttc aggtggtaaa aaaaa 385 

<210> 225 ■ 

<211> 560 

<212> PRT 

<213> Homo sapiens 

<400> 225 

Met Glu Cys Leu Tyr Tyr Phe Leu Gly Phe Leu Leu Leu Ala Ala Arg 
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1 




5 


10 










15 




Leu Pro 


Leu Asp 


Ala Ala Lys 


Arg Phe His 


Asp 


Val 


Leu 


Gly 


Asn 


Glu 


i 


20 




25 








30 






Arg Pro 


Ser Ala 


Tyr Met Arg 


Glu His Asn 


Gin 


Leu 


Asn 


Gly 


Trp 


Ser 


35 




40 






45 








Ser Asp 


Glu Asn 


Asp Trp Asn 


Glu Lys Leu 


Tyr 


Pro 


Val 


Trp 


Lys 


Arg 


50 




55 






60 










Gly Asp 


Met Arg 


Trp Lys Asn 


Ser Trp Lys 


Gly 


Gly 


Arg 


Val 


Gin 


Ala 


65 




70 




75 










80 


Val Leu 


Thr Ser 


Asp Ser Pro 


Ala Leu Val 


Gly 


Ser 


Asn 


He 


Thr 


Phe 






85 


90 










95 




Ala Val 


Asn Leu 


He Phe Pro 


Arg Cys Gin 


Lys 


Glu 


Asp 


Ala 


Asn 


Gly 




100 




105 








110 






Asn lie 


Val Tyr 


Glu Lys Asn 


Cys Arg Asn 


Glu 


Ala 


Gly 


Leu 


Ser 


Ala 




115 




120 






125 








Asp Pro 


Tyr Val 


Tyr Asn Trp 


Thr Ala Trp 


Ser 


Glu 


Asp 


Ser 


Asp 


Gly 


130 


135 


- 




140 










Glu Asn 


Gly Thr 


Gly Gin Ser 


His His Asn 


Val 


Phe 


Pro 


Asp 


Gly 


Lys 


145 


150 




155 










160 


Pro Phe 


Pro His 


His Pro Gly 


Trp Arg Arg 


Trp 


Asn 


Phe 


He 


Tyr 


Val 






165 


170 










175 




Phe His 


Thr Leu 


Gly Gin Tyr 


Phe Gin Lys 


Leu 


Gly 


Arg 


Cys 


Ser 


Val 




180 




185 








190 






Arg Val 


Ser Val 


Asn Thr Ala 


Asn Val Thr 


Leu 


Gly 


Pro 


Gin 


Leu 


Met 


195 




200 






205 








Glu Val 


Thr Val 


Tyr Arg Arg 


His Gly Arg 


Ala 


Tyr 


Val 


Pro 


He 


Ala 


210 




215 






220 










Gin Val 


Lys Asp 


Val Tyr Val 


Val Thr Asp 


Gin 


He 


Pro 


Val 


Phe 


Val 


225 


230 




235 










240 


Thr Met 


Phe Gin 


Lys Asn Asp 


Arg Asn Ser 


Ser 


Asp 


Glu 


Thr 


Phe 


Leu 






245 


250 










255 




Lys Asp 


Leu Pro 


He Met Phe 


Asp Val Leu 


lie 


His 


Asp 


Pro 


Ser 


His 


260 




265 








270 






Phe Leu 


Asn Tyr 


Ser Thr He 


Asn Tyr Lys 


Trp 


Ser 


Phe 


Gly 


Asp 


Asn 




275 




280 






285 








Thr Gly 


Leu Phe 


Val Ser Thr 


Asn His Thr 


Val 


Asn 


His 


Thr 


Tyr 


Val 


290 




295 






300 










Leu Asn 


Gly Thr 


Phe Ser Leu 


Asn Leu Thr 


Val 


Lys 


Ala 


Ala 


Ala 


Pro 


305 


310 




315 










320 


Gly Pro 


Cys Pro 


Pro Pro Pro 


Pro Pro Pro 


Arg 


Pro 


Ser 


kys 


Pr6 


Thr 




325 


330 










335 




Pro Ser 


Leu Gly 


Pro Ala Gly 


Asp Asn Pro 


Leu 


Glu 


Leu 


Ser 


Arg 


He 




340 




345 








350 






Pro Asp 


Glu Asn 


Cys Gin He 


Asn Arg Tyr 


Gly 


His 


Phe 


Gin 


Ala 


Thr 




355 




360 






365 








lie Thr 


He Val 


Glu Gly He 


Leu Glu Val 


Asn 


He 


He 


Gin 


Met 


Thr 


370 




375 






380 










Asp Val 


Leu Met 


Pro Val Pro 


Trp Pro Glu 


Ser 


Ser 


Leu 


He 


Asp 


Phe 


385 




390 




395 










400 


Val Val 


Thr Cys 


Gin Gly Ser 


He Pro Thr 


Glu 


Val 


Cys 


Thr 


He 


He 






405 


410 










415 




Ser Asp 


Pro Thr 


Cys Glu He 


Thr Gin Asn 


Thr 


Val 


Cys 


Ser 1 


Pro 


Val 


420 




425 








430 






Asp Val 


Asp Glu 


Met Cys Leu 


Leu Thr Val 


Arg 


Arg 


Thr 


Phe 


Asn 


Gly 




435 




440 






445 








Ser Gly 


Thr Tyr 


Cys Val Asn 


Leu Thr Leu 


Gly 


Asp 


Asp 


Thr 


Ser 


Leu 


450 




455 






460 










Ala Leu 


Thr Ser 


Thr Leu He 


Ser Val Pro 


Asp 


Arg 


Asp 


Pro 


Ala 


Ser 
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465 470 475 480 

Pro Leu Arg Met Ala Asn Ser Ala Leu He Ser Val Gly Cys Leu Ala 
; 48 5 490 495 

He Phe Val Thr Val He Ser Leu Leu Val Tyr Lys Lys His Lys Glu 

500 505 510 

Tyr Asn Pro He Glu Asn Ser Pro Gly Asn Val Val Arg Ser Lys Gly 

515 520 525 

Leu Ser Val Phe Leu Asn Arg Ala Lys Ala Val Phe Phe Pro Gly Asn 

530 535 540 

Gin Glu Lys Asp Pro Leu Leu Lys Asn Gin Glu Phe Lys Gly Val Ser 
545 550 555 560 



<210> 226 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<400> 226 

He Leu He Pro Ala Thr Trp Lys Ala 
1 5 



<210> 227 

<211> 9 

<212> PRT 

<213> Homo sapiens 

<400> 227 

Phe Leu Leu Asn Asp Asn Leu Thr Ala 
1 5 



<210> 228 
<211> 9 
<212> PRT 

<213> Homo sapiens 
<400>, 228 

Leu Leu Gly Asn Cys Leu Pro Thr Val 
1 5 



<210> 229 

<211> 10 

<212> PRT 

<213> Homo sapiens 

<400> 229 

Lys Leu Leu Gly Asn Cys Leu Pro Thr Val 
1 5 io 



<210> 230 

<211> 10 

<212> PRT 

<213> Homo sapiens 
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<400> 230 

Arg Leu Thr Gly Gly Leu Lys Phe Phe Val 
4 1 5 10 



<210> 231 
<211> 9 
<212> PRT 

<213> Homo sapiens 
<400> 231 

Ser Leu Gin Ala Leu Lys Val Thr Val 
1 5 



<210> 232 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 232 

Ala Gly Ala Asp Val lie Lys Asn Asp Gly He Tyr Ser Arg Tyr Phe 

15 10 15 

Phe Ser Phe Ala 
20 



<210> 233 
<211> 21 
<212> PRT 

<213> Homo sapiens 
<400> 233 

Phe Phe Ser Phe Ala Ala Asn Gly Arg Tyr Ser Leu Lys Val His Val 

1 5 10 15 

Asn His Ser Pro Ser 
20 



<210> 234 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 234 

Phe Leu Val Thr Trp Gin Ala Ser Gly Pro Pro Glu He He Leu Phe 

15 10 15 

Asp Pro Asp Gly 
20 



<210> 235 

<211> 20 

<212> PRT 

<213> Homo sapiens 

<400> 235 

Leu Gin Ser Ala Val Ser Asn He Ala Gin Ala Pro Leu Phe lie Pro 
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.Pro Asn Ser Asp 
20 



10 



15 



<210> 236 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 236 

He Gin Asp Asp Phe Asn Asn Ala He Leu Val Asn Thr Ser Lys Ara 

1 5 10 15 

Asn Pro Gin Gin 
20 



<210> 237 
<211> 21 
<212> PRT 

<213> Homo sapiens 
<400> 237 

Arg Asn Ser Leu Gin Ser Ala Val 

1 5 
Phe He Pro Pro Asn 
20 



Ser Asn He Ala Gin Ala Pro Leu 
10 15 



<210> 238 
<211> 20 
<212> PRT 

<213> Homo sapiens 

<400> 238 , 

Thr His Glu Ser His Arg He Tyr Val Ala He Arg Ala Met Asp Arg 

1 5 10 : 15 

Asn Ser Leu Gin 
20 



<210> 239 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 239 

Arg Asn Pro Gin Gin Ala Gly He Arg Glu He Phe Thr Phe Ser Pro 

15 10 15 

Gin He Ser Thr 
20 



<210> 240 
<211> 21 
<212> PRT 

<213> Homo sapiens 
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<400> 240 

Gly Gin Ala Thr Ser Tyr Glu lie Arg Met Ser Lys Ser Leu Gin Asn 
" 1 5 10 15 

lie Gin Asp Asp Phe 
20 



<210> 241 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 241 

Glu Arg Lys Trp Gly Phe Ser Arg Val Ser Ser Gly Gly Ser Phe Ser 

15 10 15 

Val Leu Gly Val 
20 



<210> 242 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 242 

Gly Ser His Ala Met Tyr Val Pro Gly Tyr Thr Ala Asn Gly Asn lie 

1 5 10 15 

Gin Met Asn Ala 
20 



<210> 243 
<211> 20 
<212> PRT 

<213> Homo sapiens , 
<400> 243 

Val Asn His Ser Pro Ser lie Ser Thr Pro Ala His Ser lie Pro Gly 

15 10 15 

Ser His Ala Met 
20 



<210> 244 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 244 

Ala Val Pro Pro Ala Thr Val Glu Ala Phe Val Glu Arg Asp Ser Leu 

15 10 ,15 

His Phe Pro His 
20 



<210> 245 
<211> 20 
<212> PRT 
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<213> Homo sapiens 
; <400> 245 

Lys Pro Gly His Trp Thr Tyr Thr Leu Asn Asn Thr His His Ser Leu 

15 10 15 

Gin Ala Leu Lys 
20 



<210> 246 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 246 

Asn Leu Thr Phe Arg Thr Ala Ser Leu Trp He. Pro Gly Thr Ala Lys 

15 10 15 

Pro Gly His Trp 
20 



<210> 247 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 247 

Leu His Phe Pro His Pro Val Met 

1 5 
Phe Tyr Pro He 
20 



He Tyr Ala Asn Val Lys Gin Gly 
10 15 



<210> 248 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 248 

Pro Glu Thr Gly Asp Pro Val Thr Leu Arg Leu Leu Asp Asp Gly Ala 

1 5 10 15 

Gly Ala Asp Val 
20 



<210> 249 
<211> 20 
<212> PRT 

<213> Homo sapiens 

<400> 249 , 

Gly Phe Tyr Pro He Leu Asn Ala Thr Val Thr Ala Thr Val Glu Pro 

1 5 10 .15 

Glu Thr Gly Asp 

20 



<210> 250 
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<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 250 

Phe Asp Pro Asp Gly Arg Lys Tyr Tyr Thr Asn Asn Phe lie Thr Asn 

15 10 15 

Leu Thr Phe Arg 
20 



<210> 251 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 251 

Leu Gin Ala Leu Lys Val Thr Val Thr Ser Arg Ala Ser Asn Ser Ala 

15 10 15 

Val Pro Pro Ala 
20 



<210> 252 
<211> 153 
<212> PRT 

<213> Homo sapiens 
<400> 252 

Met Ala Ser Val Arg Val Ala Ala Tyr Phe Glu Asn Phe Leu Ala Ala 

15 10 15 

Trp Arg Pro Val Lys Ala Ser Asp Gly Asp Tyr Tyr Thr Leu Ala Val 

20 25 30 

Pro Met Gly Asp Val Pro Met Asp Gly He Ser Val Ala Asp He Gly 

35 40 , 45 

Ala Ala Val Ser Ser He Phe Asn Ser Pro Glu Glu Phe Leu Gly Lys 

50 55 60 

Ala Val Gly Leu Ser Ala Glu Ala Leu Thr He Gin Gin Tyr Ala Asp 
65 70 75 80 

Val Leu Ser Lys Ala Leu Gly Lys Glu Val Arg Asp Ala Lys He Thr 

85 90 " 95 

Pro Glu Ala Phe Glu Lys Leu Gly Phe Pro Ala Ala Lys Glu He Ala 

100 105 110 

Asn Met Cys Arg Phe Tyr Glu Met Lys Pro Asp Arg Asp Val Asn Leu 

115 120 125 

Thr His Gin Leu Asn Pro Lys Val Lys Ser Phe Ser Gin Phe He Ser 

130 135 140 

Glu Asn Gin Gly Ala Phe Lys Gly Met 
145 150 



<210> 253 
<211> 462 
<212> DNA 
<213> Homo sapiens 



<400> 253 

atggccagtg tccgcgtggc ggcctacttt gaaaactttc tcgcggcgtg gcggcccgtg 60 
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aaagcctctg atggagatta ctacaccttg gctgtaccga tgggagatgt accaatggat 120 
.ggtatctctg ttgctgatat tggagcagcc gtctctagca tttttaattc tccagaggaa 180 
tttttaggca aggccgtggg gctcagtgca gaagcactaa caatacagca atatgctgat 240 
gttttgtcca aggctttggg gaaagaagtc cgagatgcaa agattacccc ggaagctttc 300 
gagaagctgg gattccctgc agcaaaggaa atagccaata tgtgtcgttt ctatgaaatg 360 
aagccagacc gagatgtcaa tctcacccac caactaaatc ccaaagtcaa aagcttcagc 420 
cagtttatct cagagaacca gggagccttc aagggcatgt ag 462 

<210> 254 

<211> 8031 

<212> DNA 

<213> Homo sapiens 

<400> 254 

tggcgaatgg gacgcgccct gtagcggcge attaagcgcg gcgggtgtgg tggttacgcg 60 
cagcgtgacc gctacacttg ccagcgccct agcgcccgct cctttcgctt tcttcccttc 120 
ctttctcgcc acgttcgccg gctttccccg tcaagctcta aatcgggggc . tccctttagg 180 
gttccgattt agtgctttac ggcacctcga ccccaaaaaa cttgattagg gtgatggttc 240 
acgtagtggg ccatcgccct gatagacggt ttttcgccct ttgacgttgg agtccacgtt 300 
ctttaatagt ggactcttgt tccaaactgg aacaacactc aaccctatct cggtctattc 360 
ttttgattta taagggattt tgccgatttc ggcctattgg ttaaaaaatg agctgattta 420 
acaaaaattt aacgcgaatt ttaacaaaat attaacgttt acaatttcag gtggcacttt 480 
tcggggaaat gtgcgcggaa cccctatttg tttatttttc taaatacatt caaatatgta 540 
tccgctcatg aattaattct tagaaaaact catcgagcat caaatgaaac tgcaatttat 600 
tcatatcagg attatcaata ccatattttt gaaaaagccg tttctgtaat gaaggagaaa 660 
actcaccgag gcagttccat aggatggcaa gatcctggta tcggtctgcg attccgactc 720 
gtccaacatc aatacaacct attaatttcc cctcgtcaaa aataaggtta tcaagtgaga 780 
aatcaccatg agtgacgact gaatccggtg agaatggcaa aagtttatgc atttctttcc 840 
agacttgttc aacaggccag ccattacgct cgtcatcaaa atcactcgca tcaaccaaac 900 
cgttattcat tcgtgattgc gcctgagcga gacgaaatac gcgatcgctg ttaaaaggac 960 
aattacaaac aggaatcgaa tgcaaccggc gcaggaacac tgccagcgca tcaacaatat 1020 
tttcacctga atcaggatat tcttctaata cctggaatgc tgttttcccg gggatcgcag 1080 
tggtgagtaa ccatgcatca tcaggagtac ggataaaatg cttgatggtc ggaagaggca 1140 
taaattccgt cagccagttt agtctgacca tctcatctgt aacatcattg gcaacgctac 1200 
ctttgccatg tttcagaaac aactctggcg catcgggctt cccatacaat cgatagattg 1260 
tcgcacctga ttgcccgaca ttatcgcgag cccatttata cccatataaa tcagcatcca 1320 
tgttggaatt taatcgcggc ctagagcaag acgtttcccg ttgaatatgg ctcataacac 1380 
cccttgtatt actgtttatg taagcagaca gttttattgt tcatgaccaa aatcccttaa 1440 
cgtgagtttt cgttccactg agcgtcagac cccgtagaaa agatcaaagg atcttcttga 1500 
gatccttttt ttctgcgcgt aatctgctgc ttgcaaacaa aaaaaccacc gctaccagcg 1560 
gtggtttgtt tgccggatca agagctacca actctttttc cgaaggtaac tggcttcagc 1620 
agagcgcaga taccaaatac tgtccttcta gtgtagccgt agttaggcca ccacttcaag 1680 
aactctgtag caccgcctac atacctcgct ctgctaatcc tgttaccagt ggctgctgcc 1740 
agtggcgata agtcgtgtct taccgggttg gactcaagac gatagttacc ggataaggcg 1800 
cagcggtcgg gctgaacggg gggttcgtgc acacagccca gcttggagcg aacgacctac 1860 
accgaactga gatacctaca gcgtgagcta tgagaaagcg ccacgcttcc cgaagggaga 1920 
aaggcggaca ggtatccggt aagcggcagg gtcggaacag gagagcgcac gagggagctt 1980 
ccagggggaa acgcctggta tctttatagt cctgtcgggt ttcgccacct ctgacttgag 2040 
cgtcgatttt tgtgatgctc gtcagggggg cggagcctat ggaaaaacgc cagcaacgcg 2100 
gcctttttac ggttcctggc cttttgctgg ccttttgctc acatgttctt tcctgcgtta 2160 
tcccctgatt ctgtggataa ccgtattacc gcctttgagt gagctgatac cgctcgccgc 2220 
agccgaacga ccgagcgcag cgagtcagtg agcgaggaag cggaagagcg cctgatgcgg 2280 
tattttctcc ttacgcatct gtgcggtatt tcacaccgca tatatggtgc actctcagta 2340 
caatctgctc tgatgccgca tagttaagcc agtatacact ccgctatcgc tacgtgactg 2400 
ggtcatggct gcgccccgac acccgccaac acccgctgac gcgccctgac gggcttgtct 2460 
gctcccggca tccgcttaca gacaagctgt gaccgtctcc gggagctgca tgtgtcagag 2520 
gttttcaccg tcatcaccga aacgcgcgag gcagctgcgg taaagctcat cagcgtggtc 2580 
gtgaagcgat tcacagatgt ctgcctgttc atccgcgtcc agctcgttga gtttctccag 2640 



WO 02/00174 



PCT/US01/21065 



114 



aagcgttaat gtctggcttc tgataaagcg 
ggtcactgat gcctccgtgt aagggggatt 
acgagagagg atgctcacga tacgggttac 
ttgtgagggt aaacaactgg cggtatggat 
tcaatgccag cgcttcgtta atacagatgt 
tgcgatgcag atccggaaca taatggtgca 
cgaaacacgg aaaccgaaga ccattcatgt 
gcagtcgctt cacgttcgct cgcgtatcgg 
ccgccagcct agccgggtcc tcaacgacag 
catgccggcg ataatggcct gcttctcgcc 
ggcttgagcg agggcgtgca agattccgaa 
gctccagcga aagcggtcct cgccgaaaat 
gagttgcatg ataaagaaga cagtcataag 
ccggaaggag ctgactgggt tgaaggctct 
atgagtgagc taacttacat taattgcgtt 
cctgtcgtgc cagctgcatt aatgaatcgg 
tgggcgccag ggtggttttt cttttcacca 
ccgcctggcc ctgagagagt tgcagcaagc 
aatcctgttt gatggtggtt aacggcggga 
atcccactac cgagatatcc gcaccaacgc 
cgcccagcgc catctgatcg ttggcaacca 
gcatttgcat ggtttgttga aaaccggaca 
tcggctgaat ttgattgcga gtgagatatt 
agacagaact taatgggccc gctaacagcg 
gctccacgcc cagtcgcgta ccgtcttcat 
ggtcagagac atcaagaaat aacgccggaa 
catcctggtc atccagcgga tagttaatga 
tgtgcaccgc cgctttacag gcttcgacgc 
tggcacccag ttgatcggcg cgagatttaa 
gggccagact ggaggtggca acgccaatca 
ccacgcggtt gggaatgtaa ttcagctccg 
tcgcagaaac gtggctggcc tggttcacca 
catactctgc gacatcgtat aacgttactg 
cttccgggcg ctatcatgcc ataccgcgaa 
tctcgacgct ctcccttatg cgactcctgc 
ccgttgagca ccgccgccgc aaggaatggt 
ccggccacgg ggcctgccac catacccacg 
cgagcccgat cttccccatc ggtgatgtcg 
gcgccggtga tgccggccac gatgcgtccg 
aattaatacg actcactata ggggaattgt 
ttttgtttaa ctttaagaag gagatataca 
acagcttcaa gacaatgggt ataatggatt 
gaatcagaac ctcatctcaa acattaagga 
taatgctacc aagagaagag tatttttcag 
gaaagctaat aataacagca aaataaaaca 
gactgactgg tatggggcac atggagatga 
aaaagaggga aaatacattc atttcacacc 
tggctacgga tcacgaggcc gagtgtttgt 
gttcgatgag tataacaatg acaaaccttt 
gacaaggtgt tcatctgaca tcacaggcat 
agaaaactgt attattagta agctttttaa 
ccaaaatgca actgcatcaa taatgttcat 
taatgcaagt acccacaacc aagaagcacc 
aagtgcatgg gatgtaatca cagactctgc 
gactgagctt ccacctcctc ccacattctc 
tttagtgctg gatgtgtcca gcaagatggc 
agccgcagaa ttttatttga tgcagattgt 
tttcgacagc aaaggagaga tcagagccca 



ggccatgtta agggcggttt tttcctgttt 2700 
tctgttcatg ggggtaatga taccgatgaa 2760 
tgatgatgaa catgcccggt tactggaacg 2820 
gcggcgggac cagagaaaaa tcactcaggg 2880 
aggtgttcca cagggtagcc agcagcatcc 2940 
gggcgctgac ttccgcgttt ccagacttta 3000 
tgttgctcag gtcgcagacg ttttgcagca 3060 
tgattcattc tgctaaccag taaggcaacc 3120 
gagcacgatc atgcgcaccc gtggggccgc 3180 
gaaacgtttg gtggcgggac cagtgacgaa 3240 
taccgcaagc gacaggccga tcatcgtcgc 3300 
gacccagagc gctgccggca cctgtcctac 3360 
tgcggcgacg atagtcatgc cccgcgccca 3420 
caagggcatc ggtcgagatc ccggtgccta 3480 
gcgctcactg cccgctttcc agtcgggaaa 3540 
ccaacgcgcg gggagaggcg gtttgcgtat 3600 
gtgagacggg caacagctga ttgcccttca 3660 
ggtccacgct ggtttgcccc agcaggcgaa 3720 
tataacatga gctgtcttcg gtatcgtcgt 3780 
gcagcccgga ctcggtaatg gcgcgcattg 3840 
gcatcgcagt gggaacgatg ccctcattca 3900 
tggcactcca gtcgccttcc cgttccgcta 3960 
tatgccagcc agccagacgc agacgcgccg 4020 
cgatttgctg gtgacccaat gcgaccagat 4080 
gggagaaaat aatactgttg atgggtgtct 4140 
cattagtgca ggcagcttcc acagcaatgg 4200 
tcagcccact gacgcgttgc gcgagaagat 4260 
cgcttcgttc taccatcgac accaccacgc 4320 
tcgccgcgac aatttgcgac ggcgcgtgca 4380 
gcaacgactg tttgcccgcc agttgttgtg 4440 
ccatcgccgc ttccactttt tcccgcgttt 4500 
cgcgggaaac ggtctgataa gagacaccgg 4560 
gtttcacatt caccaccctg aattgactct 4620 
aggttttgcg ccattcgatg gtgtccggga 4680 
attaggaag9 agcccagtag taggttgagg 4740 
gcatgcaagg agatggcgcc caacagtccc 4800 
ccgaaacaag cgctcatgag cccgaagtgg 4860 
gcgatatagg cgccagcaac cgcacctgtg 4920 
gcgtagagga tcgagatctc gatcccgcga 4 980 
gagcggataa caattcccct ctagaaataa 5040 
tatgcagcat caccaccatc accacggagt 5100 
gctcattgca attaatcctc aggtacctga 5160 
aatgataact gaagcttcat tttacctatt 5220 
aaatataaag attttaatac ctgccacatg 5280 
agaatcatat gaaaaggcaa atgtcatagt 5340 
tccatacacc ctacaataca gagggtgtgg 5400 
taatttccta ctgaatgata acttaacagc 5460 
ccatgaatgg gcccacctcc gttggggtgt 5520 
ctacataaat gggcaaaatc aaattaaagt 5580 
ttttgtgtgt gaaaaaggtc cttgccccca 5640 
agaaggatgc acctttatct ^caatagcac 5700 
gcaaagttta tcttctgtgg ttgaattttg 5760 
aaacctacag aaccagatgt gcagcctcag 5820 
tgactttcac cacagctttc ccatgaacgg 5880 
gcttgtagag gctggtgaca aagtggtctg 5940 
agaggctgac agactccttc aactacaaca 6000 
tgaaattcat accttcgtgg gcattgccag 6060 
gctacaccaa attaacagca atgatgatcg 6120 
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aaagttgctg gtttcatatc tgcccaccac 
.ttcagggctt aagaaaggat ttgaggtggt 
tgtgatgata ttagtgacca gcggagatga 
gctcagcagt ggttcaacaa ttcactccat 
ggaggaatta tcacgtctta caggaggttt 
caatagcatg attgatgctt tcagtagaat 
acatattcag cttgaaagta caggtgaaaa 
agtgactgtg gataatactg tgggcaacga 
tggtcctcct gagattatat tatttgatcc 
tatcaccaat ctaacttttc ggacagctag 
gcactggact tacaccctga acaataccca 
gacctctcgc gcctccaact cagctgtgcc 
agacagcctc cattttcctc atcctgtgat 
tcccattctt aatgccactg tcactgccac 
gctgagactc cttgatgatg gagcaggtgc 
gaggtatttt ttctcctttg ctgcaaatgg 
ctctcccagc ataagcaccc cagcccactc 
aggttacaca gcaaacggta atattcagat 
tgaggaggag cgaaagtggg gctttagccg 
gggagttcca gctggccccc accctgatgt 
agctgtaaaa gtagaagagg aattgaccct 
tcagggccag gctacaagct atgaaataag 
tgactttaac aatgctattt tagtaaatac 
cagggagata tttacgttct caccccaaat 
tggagaaaca catgaaagcc acagaattta 
cttacagtct gctgtatcta acattgccca 
tcctgtacct gccagagatt atcttatatt 
aggaatcatt tgccttatta tagttgtgac 
agacaagaaa gagaatggaa caaaattatt 
ggcggccgct cgagcaccac caccaccacc 
aggaagctga gttggctgct gccaccgctg 
ctaaacgggt cttgaggggt tttttgctga 



tgtatcagct aaaacagaca tcagcatttg 6180 
tgaaaaactg aatggaaaag cttatggctc 6240 
taagcttctt ggcaattgct tacccactgt 6300 
tgccctgggt tcatctgcag ccccaaatct 6360 
aaagttcttt gttccagata tatcaaactc 6420 
ttcctctgga actggagaca ttttccagca 6480 
tgtcaaacct caccatcaat tgaaaaacac 6540 
cactatgttt ctagttacgt ggcaggccag 6600 
tgatggacga aaatactaca caaataattt 6660 
tctttggatt ccaggaacag ctaagcctgg 6720 
tcattctctg caagccctga aagtgacagt 6780 
cccagccact gtggaagcct ttgtggaaag 6840 
gatttatgcc aatgtgaaac agggatttta 6900 
agttgagcca gagactggag atcctgttac 6960 
tgatgttata aaaaatgatg gaatttactc 7020 
tagatatagc ttgaaagtgc atgtcaatca 7080 
tattccaggg agtcatgcta tgtatgtacc 7140 
gaatgctcca aggaaatcag taggcagaaa 7200 
agtcagctca ggaggctcct tttcagtgct 7260 
gtttccacca tgcaaaatta ttgacctgga 7320 
atcttggaca gcacctggag aagactttga 7380 
aatgagtaaa agtctacaga atatccaaga 7440 
atcaaagcga aatcctcagc aagctggcat 7500 
ttccacgaat ggacctgaac atcagccaaa 7560 
tgttgcaata cgagcaatgg ataggaactc 7620 
ggcgcctctg tttattcccc ccaattctga 7680 
gaaaggagtt ttaacagcaa tgggtttgat 7740 
acatcatact ttaagcagga aaaagagagc 7800 
ataatgaatt ctgcagatat ccatcacact 7860 
actgagatcc ggctgctaac aaagcccgaa 7920 
agcaataact agcataaccc cttggggcct 7980 
aaggaggaac tatatccgga t 8031 



<210> 255 
<211> 401 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> mis cofeature 

<222> 9, 67 r 247 f 275/ 277, 397 

<223> n = A,T,C or G 

<400> 255 

gtggccagng actagaaggc gaggcgccgc 
agtccanagg acggagaaga cgaggaagag 
ggaattattg attcagactt cctctcaaaa 
gacactgaga ggcccattct gcaagtggac 
actctangga cctgtgttat atttgaagaa 
aataaaacag tgctaaaata taaatgccat 
ctcctgacag agaagaagga aggagaagaa 

<210> 256 
<211> 401 
<212> DNA 
<213> Homo sapiens 



gggaccatgg cggcggcggc ggacgagcgg 60 
gaggagcagt tggttctggt ggaattatca 120 
tgtgaaaata aatgcaaggt tttgggcatt 180 
agctgtgtct ttgctgggga gtatgaagac 240 
aatgntnaac atgctgatac afgaaggcaat 300 
acaatgaaga agctcagcat gacaagaact 360 
aacatangtg g 401 
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<220> 

X221> misc_feature 

<222> 7, 37, 51, 19, 96, 98, 103, 104, 107, 116, 167, 181, 183, 
194, 206, 276, 303, 307, 308, 310, 323, 332, 341, 353, 374, 
376 

<223> n » A,T,C or G 
<400> 256 

tggtggncct gggatgggga accgcggtgg cttccgngga ggtttcggca ntggcatccg 60 
gggccggggt cgcggccgng gacggggccg gggccnangc cgnnganctc gcggangcaa 120 
ggccgaggat aaggagtgga tgcccgtcac caacttgggc cgcttgncca aggacatgaa 180 
nancaagccc ctgnaggaga tctatntctt cttccctgcc ccattaagga atcaagagat 240 
catttgattt cttcctgggg gcctctctca aggatnaggt ttttgaagat tatgccagtg 300 
canaaannan accccgttgc ccngtccatc tncacccaac ncttccaagg gcnatttttg 360 
tttaggcctc attncngggg ggaaccttaa cccaatttgg g 401 

<210> 257 
<211> 401 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 382, 387 
<223> n = A,T,C or G 

<400> 257 

atgtatgtaa aacacttcat aaaatgtaaa gggctataac aaatatgtta taaagtgatt 60 
ctctcagccc tgaggtatac agaatcattt gcctcagact gctgttggat tttaaaattt 120 
ttaaaatatc tgctaagtaa tttgctatgt cttctcccac actatcaata tgcctgcttc 180 
taacaggctc cccactttct tttaatgtgc tgttatgagc tttggacatg agataaccgt 240 
gcctgttcag agtgtctaca gtaagagctg gacaaactct ggagggacac agtctttgag 300 
acagctcttt tggttgcttt ccacttttct gaaaggttca cagtaacctt ctagataata 360 
gaaactccca gttaaagcct angctancaa ttttttttajg t 401 

<210> 258 
<211> 401 
<212> DNA 
<213> Homo sapiens 

<400> '258 

ggagcgctag gtcggtgtac gaccgagatt agggtgcgtg ccagctccgg gaggccgcgg 60 
tgaggggccg ggcccaagct gccgacccga gccgatcgtc agggtcgcca gcgcctcagc 120 
tctgtggagg agcagcagta gtcggagggt gcaggatatt agaaatggct actccccagt 180 
caattttcat ctttgcaatc tgcattttaa tgataacaga attaattctg gcctcaaaaa 240 
gctactatga tatcttaggt gtgccaaaat cggcatcaga gcgccaaatc aagaaggcct 300 
ttcacaagtt ggccatgaag taccaccctg acaaaaataa gacccagatg ctgaagcaaa 360 
attcagagag attgcagaag catatgaaac actctcagat g 401 

<210> 259 
<211> 401 
<212> DNA 
<213> Homo sapiens 

<400> 259 

attgggtttg gagggaggat gatgacagag gaatgccctt tggccatcac ggttttgatt 60 
ctccagaata ttgtgggttt gatcatcaat gcagtcatgt taggctgcat tttcatgaaa 120 
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acagctcagg ctcacagaag ggcagaaact ttgattttca gccgccatgc tgtgattgcc 180 

..gtccgaaatg gcaagctgtg cttcatgttc cgagtgggtg acctgaggaa aagcatgatc 240 

attagtgcct ctgtgcgcat ccaggtggtc aagaaaacaa ctacacctga aggggaggtg 300 

gttcctattc accaactgga cattcctgtt gataacccaa tcgagagcaa taacattttt 360 

ctggtggccc ctttgatcat ctgccacgtg attgacaagc g 401 

<210> 260 
<211> 363 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> miscjfeature 

<222> 7, 9, 19, 41, 63, 73, 106, 111, 113, 116, 119, 156, 158, 
162, 187, 247, 288, 289, 290, 292, 298, 299, 300, 340 
<223> n m A,T,C or G 

<400> 260 

aggaganang gagggggana tgaataggga tggagaggga natagtggat gagcagggca 60 

canggagagg aancagaaag gagaggcaag acagggagac acacancaca nangangana 120 

caggtggggg ctggggtggg gcatggagag cctttnangt cncccaggcc accctgctct 180 

cgctggnctg ttgaaaccca ctccatggct tcctgccact gcagttgggc ccagggctgg 240 

cttattnctg gaatgcaagt ggctgtggct tggagcctcc cctctggnnn anggaaannn 300 

attgctccct tatctgcttg gaatatctga gtttttccan cccggaaata aaacacacac 360 



<210> 261 
<211> 401 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc__feature 
<222> 114, 152 

<223> n - A,T,C or G i 
<400> 261 

cggctctccg ccgctctccc ggggtttcgg ggcacttggg tcccacagtc tggtcctgct 60 
tcaccttccc ctgacctgag tagtcgccat ggcacaggtt ctcagaggca ctgngactga 120 
cttccctgga tttgatgagc gggctgatgc anaaactctt cggaaggcta tgaaaggctt 180 
gggcacagat gaggagagca tcctgactct gttgacatcc cgaagtaatg ctcagcgcca 240 
ggaaatctct gcagctttta agactctgtt tggcagggat cttctggatg acctgaaatc 300 
agaactaact ggaaaatttg aaaaattaat tgtggctctg atgaaaccct ctcggcttta 360 
tgatgcttat gaactgaaac atgccttgaa gggagctgga a 401 

<210> 262 
<211> 401 
<212> DNA 
<213> Homo sapiens 

<220> , 
<221> misc_feature 

<222> 7, 26, 258, 305, 358, 373, 374, 378 
<223> n = A,T,C or G 

<400> 262 

agtctanaac atttctaata ttttgngctt tcatatatca aaggagatta tgtgaaacta 60 
tttttaaata ctgtaaagtg acatatagtt ataagatata tttctgtaca gtagagaaag 120 
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agtttataac atgaagaata ttgtaccatt 
ttcaaaagaa taatgataga ggtgaaaata 
; tcaactcaaa aattatgntg catagtttta 
tccancttca atgagaaaat aaaatctaca 
tttttttgct aannagcnaa aaatataaac 

<210> 263 
<211> 401 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 232, 290, 304, 326, 383 

<223> n » A,T,C or G 



atacattttc attctcgatc tcataagaaa 180 
tgtttacttt ctctaaatca agcctagttg 240 
ttttgaattt aggttttggg actacttttt 300 
actcaggagt tactacagaa gttctaanta 360 
atatgaaaat g 401 



<400> 263 

ctgtccgacc aagagaggcc ggccgagccc gaggcttggg cttttgcttt ctggcggagg 60 
gatctgcggc ggtttaggag gcggcgctga tcctgggagg aagaggcagc tacggcggcg 120 
gcggcggtgg cggctagggc ggcggcgaat aaaggggccg ccgccgggtg atgcggtgac 180 
cactgcggca ggcccaggag ctgagtgggc cccggccctc agcccgtccc gncggacccg 240 
ctttcctcaa ctctccatct tctcctgccg accgagatcg ccgaggcggn ctcaggctcc 300 
ctancccctt ccccgtccct tccccncccc cgtccccgcc ccgggggccg ccgccacccg 360 
cctcccacca tggctctgaa ganaatccac aaggaattga a 401 

<210> 264 
<211> 401 
<212> DNA 
<213> Homo sapiens 

<400> 264 

aacaccagcc actccaggac ccctgaaggc ctctaccagg tcaccagtgt tctgcgccta 60 

aagccacccc ctggcagaaa cttcagctgt gtgttctgga atactcacgt gagggaactt 120 

actttggcca gcattgacct tcaaagtcag atggaaccca ggacccatcc aacttggctg 180 

cttcacattt tcatcccctc ctgcatcatt gctttcattt tcatagccac agtgatagcc 240 
ctaagaaaac aactctgtca aaagctgtat tcttcaaaag acacaacaaa aagacctgtc 300 

accacaacaa agagggaagt gaacagtgct gtgaatctga acctgtggtc ttgggagcca 360 

gggtgacctg atatgacatc taaagaagct tctggactct g 401 

<210> 265 
<211> 271 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> mis cofeature 
<222> 59 

<223> n = A,T,C or G 
<400> 265 

gccacttcct gtggacatgg gcagagcgct gctgccagtt cctggtagcc ttgaccacna 60 
cgctgggggg tctttgtgat ggtcatgggt ctcatttgca cttgggggtg tgggattcaa 120 
gttagaagtt tctagatctg gccgggcgca gtggctcaca cctgtaatcc cagcacttta 180 
ggaggctgag gcaggcggat catgaggtca ggagatcgag accgtcctgg ctaacacagt 240 
gaaaccccgt ctctactaaa aatacaaaaa a ^ . 

<210> 266 
<211> 401 
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<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<222> 45 

<223> n = A,T,CorG 



<400> 266 

attcataaat ttagctgaaa gatactgatt caatttgtat acagngaata taaatgagac 60 
gacagcaaaa ttttcatgaa atgtaaaata tttttatagt ttgttcatac tatatgaggt 120 
tctattttaa atgactttct ggattttaaa aaatttcttt aaatacaatc atttttgtaa 180 
tatttatttt atgcttatga tctagataat tgcagaatat cattttatct gactctgtct 240 
tcataagaga gctgtggccg aattttgaac atctgttata gggagtgatc aaattagaag 300 
gcaatgtgga aaaacaattc tgggaaagat ttctttatat gaagtccctg ccactagcca 360 
gccatcctaa ttgatgaaag ttatctgttc acaggcctgc a ~* 401 

i 

<210> 267 
<211> 401 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> mis cofeature 

tll^S^'slV' ™ ' ^ ^ * 23 ' 336 ' ^ 355 ' 365 ' 

<223> n — A, T, C or G 

<400> 267 

gaagaggcat cacctgatcc cggagacctt tggagttaag aggcggcgga agcgagggcc 60 
tgtggagtcg gatcctcttc ggggtgagcc agggtcggcg cgcgcggctg tctcanaact 120 
catgcagctg ttcccgcgag gcctgtttga ggacgcgctg ccgcccatcg tgctgaggag 180 
ccaggtgtac agccttgtgc ctgacaggac cgtggccgac cggcagctga aggagcttca 240 
agagcanggg gagacaaaat cgtccagctg ggcttcnact tggatgccca tggaanttat 300 
tctttcnctt ganggactta cnngggaccc aagaanccc^ tncaaggggc ccttngtgga 360 
tgggncccga aaccccnnta tttgcccttg ggggggncca a 40 i 

<210> 268 
<211> 223 
<212> DNA 

<213> Homo sapiens 
<400> 268 

tcgccatgtt ggccaggctg gtcttgaact cctgacttta agtgatccac ccgcctcaac 60 
ctcccaaagt gctgggatta caggtgtgag ccaccgcgcc tggcctgata catactttta 120 
gaatcaagta gtcacgcact ttttctgttc atttttctaa aaagtaaata tacaaatott 180 
ttgttttttg ttttttttgt ttgtttgttt ctgttttttt ttt 2 23 

<210> 269 
<211> 401 
<212> DNA 

<213> Homo sapiens ? 
<400> 269 

actatgtaaa ccacattgta ctttttttta ctttggcaac aaatatttat acatacaaga 60 
tgctagttca tttgaatatt tctcccaact tatccaagga tctccagctc taacaaaatg 120 
gtttattttt atttaaatgt caatagttgt tttttaaaat ccaaatcaga ggtgcaggcc 180 
accagttaaa tgccgtctat caggttttgt gccttaagag actacagagt caaagctcat 240 
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ttttaaagga gtaggacaaa gttgtcacag gtttttgttg ttgtttttat tgcccccaaa 300 
attacatgtt aatttccatt tatatcaggg attctattta cttgaagact gtgaagttgc 360 
battttgtct cattgttttc tttgacataa ctaggatcca t 401 

<210> 270 
<211> 401 
<212> DNA 
<213> Homo sapiens 

t <220> 

<221> mis cofeature 
<222> 240, 382 
<223> n = A,T,C or G 

<400> 270 

tggctgttga ttcacctcag cactgcttgg tatctgcacc ctacctctct ttagaggctg 60 

: ccttgtcaac tgaaaaatgc acctgacttc gagcaagact ctttccttag gttctggatc 120 

tgtttgagcc ccatggcact gagctggaat ctgagggtct tgttccaagg atgtgatgat 180 

gtgggagaat gttctttgaa agagcagaaa tccagtctgc atggaaacag cctgtagagn 240 

agaagtttcc agtgataagt gttcactgtt ctaaggaggt acaccacagc tacctgaatt 300 

ttcccaaaat gagtgcttct gtgcgttaca actggccttt gtacttgact gtgatgactt 360 
tgttttttct tttcaattct anatgaacat gggaaaaaat g 401 

<210> 271 
<211> 329 
<212> DNA 
<213> Homo sapiens 

<400> 271 

ccacagcctc caagtcaggt ggggtggagt cccagagctg cacagggttt ggcccaagtt 60 
tctaagggag gcacttcctc ccctcgccca tcagtgccag cccctgctgg ctggtgcctg 120 
agcccctcag acagccccct gccccgcagg cctgccttct cagggacttc tgcggggcct 180 
gaggcaagcc atggagtgag acccaggagc cggacacttc tcaggaaatg gcttttccca 240 
acccccagcc cccacccggt ggttcttcct gttctgtgac tgtgtatagt gccaccacag 300 
cttatggcat ctcattgagg acaaaaaaa t 329 

<210> 272 
<211> 401 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 1, 7, 12, 21, 61, 62, 66, 72, 78, 88, 90, 92, 98, 117, 119, 

128, 130, 134, 142, 144, 151, 159, 162, 164, 168, 169, 177, 

184, 185, 188, 194, 202, 204, 209, 213, 218, 223, 231, 260, 

272, 299, 300, 306, 321, 322, 323, 331, 335, 336, 338 
<223> n - A,T,C or G 

<221> misc_feature 

<222> 341, 342, 343, 345, 346, 351, 358, 360, 362, 363, ,387, 390, 
392 

<223> n = A,T,C or G 
<400> 272 

nggctgntaa cntcggaggt nacttcctgg actatcctgg agaccccctc cgcttccacg 60 
nncatnatat cnctcatngc tgggcccntn angacacnat cccactccaa cacctgngng 120 
atgctggncn cctnggaacc ancntcagaa ngaccctgnt cntntgtnnt ccgcaanctg 180 
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aagnnaangc gggntacacc tncntgcant 
acgggatgtg gctgcgccan gagccaagag 
^agggantcta caacattgct nnntaccttt 
tnntaacact acatcttttt tactgcnccn 

<210> 273 
<211> 401 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<222> 399 

<223> n « A, T,C or G 



ggnccacnct gcngggaact ntacacacct 240 
cntttctgga tgattcccca gcctcttgnn 300 
ntccnncngc nnntnntgga ntacaggngn 360 
tncttggtgg g 40 i 



<400> 273 

cagcaccatg aagatcaaga tcatcgcacc 
tggctccatc ctggcctcac tgtccacctt 
cgacgagtcg ggcccctcca tcgtccaccg 
gtagcatttg ctgcatgggt taattgagaa 
ctcatgctag cctcacgaaa ctggaataag 
tatctgatat cagcactgga ttgtagaact 
aactgttccc cttggtatta acgtgtcagg 

<210> 274 
<211> 401 
<212> DNA 

<213> Homo sapiens 



cccagagcgc aagtactcgg tgtggatcgg 60 
ccagcagatg tggattagca agcaggagta 120 
caaatgcttc taaacggact cagcagatgc 180 
tagaaatttg cccctggcaa atgcacacac 240 
ccttcgaaaa gaaattgtcc ttgaagcttg 300 
tgttgctgat tttgaccttg tattgaagtt 360 
gctgagtgnt c 401 



<400> 274 

ccacccacac ccaccgcgcc ctcgttcgcc 
cgccgcccag gccatcgcca ccctccgcag 
cctaccgcag gatgttcggc ggcccgggca 
acgtgactac gtccacccgc acctacagcc 
gcagcctcta cgcctcgtcc ccgggcggcg 
tgcggagcag cgtgcccggg gtgcggctcc 
acgccatcaa caccgagttc aagaacaccc 

<210> 275 
<211> 401 
<212> DNA 

<213> Homo sapiens 



tcttctccgg gagccagtcc gcgccaccgc 60 
ccatgtccac caggtccgtg tcctcgtcct 120 
ccgcgagccg gccgagctcc agccggagct 180 
tgggcagcgc gctgcgcccc agcaccagcc 240 
tgtatgccae gcgctcctct gccgtgcgcc 300 
tgcaggactc ggtggacttc tcgctggccg 360 
gcaccaacga g 401 



<400> 275 

ccacttccac cactttgtgg agcagtgcct 
ctggcctggg cctgggcttc gggagagcag 
gaagggactt acctcccaaa ggttctgcag 
agctcctggg tgtgtcagag gccagcctgg 
agggagaggg agaggggacc cgaggctgag 
gacacggcag tgatgctgcg gtctctcctc 
ctcctgaacc actctttctt caagcagatc 

<210> 276 
<211> 401 
<212> DNA 

<213> Homo sapiens 



tcagcgcaac ccggatgcca ggtatccctg 60 
agggtgctca ggagggtaag gccagggtgt 120 
gggaatctgg agctacacac aggagggatc 180 
ggagctctgg ccactgcttc ccatgagctg 240 
gcataagtgg caggatttcg ggaagctggg 300 
ccctttccct ccaggcccag tgccagcacc 360 
aagcgacgtg c , 40 i 



<220> 
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<221> misc_feature 
<222> 11 

"<223> n = A,T,C or G 



<400> 276 

tctgatattg ntacccttga gccacctaag 
attgttgaag aagcacagag ttcagaagac 
tatactttct gtcagccaga aactgtattt 
agtgatgaaa ccagtaatca gcccagtcct 
accgtttctg cttcagaatc tgaagaccgg 
aaggagttga gtaaacgtca gttcagtagt 
gtgattgcaa tcagcatggg atttggccat 

<210> 277 
<211> 401 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> miscfeature 
<222> 227, 333 
<223> n = A,T,C or G 



ttagaagaaa ttggaaatca agaagttgtc 60 

tttaacatgg gctcttcctc tagcagccag 120 

t cat ct cage ctagtgatga tgaatcaagt 180 

gectttagae gacgccgtgc taggaagaag 240 

ctagttggtg aacaagaaac tgaaccttct 300 

ggtctcaata agtgtgttat acttgetttg 360 

ttctatggca c 401 



<400> 277 

aactttggca acatatctca gcaaaaacta 
tgtgcagagg agtggctgca atgaggtcac 
gtcctcatca cccatccctc gaactcaagt 
tccacacatc ctgccccatc aagatgttct 
gatgettett gaaaattget tagttgaaaa 
acagtgggaa gagaggctgc aggaacagcg 
cgggcgcacc agtcgtagta atccccccaa 

<210> 278 
<2U> 401 
<212> DNA 
<213> Homo sapiens 



cagctatgtt attcatgeca aaataaaagc 60 
aacggtggtg gatgtaaaag agatcttcaa 120 
cccgctcatt acaaattctt ettgecagtg 180 
catcatgtgt tacgagnggc gctcaaggat 240 
atggagagat cagcttagta aaagatccat 300 
ganaacagtt caggacaaga agaaaacagc 360 
accaaaggga a 401 



<220> 

<221> misc_feature 
<222> 322, 354 
<223> n - A,T,C or G 



<400> 278 

aatgagtgtg agaccacaaa tgaatgeegg 
ggcttccgtt gttatccacg aaatccttgt 
cgatgtgttt gcccagtctc aaatgccatg 
aaatacatga gcatccgatc tgataggtct 
acaactattt atgccaacac catcaatact 
gagtctacct acgacaacaa anccctgtaa 
caggaccaag agaacatatc gtggacctgg 

<210> 279 
<211> 401 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc feature 



gaggatgaaa tgtgttggaa ttatcatggc 60 
caagatccct acattctaac accagagaac 120 
tgecgagaac tgccccagtc aatagtctac 180 
gtgccatcag acatcttcca gatacaggee 240 
ttteggatta aatctggaaa . tgaaaatgga 300 
gtgeaatget tgtgctcgtg aagncattat 360 
agatgetgae a 401 
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<222> 30, 35,. 81, 88, 180, 212, 378, 384, 391 
<223> n = A,T,C or G 

<400> 279 

aaattattgc ctctgataca tacctaagtn aacanaacat taatacctaa gtaaacataa 60 
cattacttgg agggttgcag nttctaantg aaactgtatt tgaaactttt aagtatactt 120 
taggaaacaa gcatgaacgg cagtctagaa taccagaaac atctacttgg gtagcttggn 180 
gccattatcc tgtggaatct gatatgtctg gnagcatgtc attgatggga catgaagaca 240 
tctttggaaa tgatgagatt atttcctgtg ttaaaaaaaa aaaaaatctt aaattcctac 300 
aatgtgaaac tgaaactaat aattttgatc ctgatgtatg ggacagcgta tctgtaccag 360 
gctctaaata acaaaagnta gggngacaag nacatgttcc t 401 

<210> 280 
<211> 326 
<212> DNA 
<213> Homo sapiens 

<400> 280 

gaagtggaat tgtataattc aattcgataa ttgatctcat gggctttccc tggaggaaag 60 

gttttttttg ttgttttttt tttaagaact tgaaacttgt aaactgagat gtctgtagct 120 

tttttgccca tctgtagtgt atgtgaagat ttcaaaacct gagagcactt tttctttgtt 180 

tagaattatg agaaaggcac tagatgactt taggatttgc atttttccct ttattgcctc 240 

atttcttgtg acgccttgtt ggggagggaa atctgtttat tttttcctac aaataaaaag 300 
ctaagattct atatcgcaaa aaaaaa ^26 

<210> 281 
<211> 374 
<212> DNA 
<213> Homo sapiens 

<400> 281 

caacgcgttt gcaaatattc ccctggtagc ctacttcctt acccccgaat attggtaaga 60 
tcgagcaatg gcttcaggac atgggttctc ttctcctgtg atcattcaag tgctcactgc 120 
atgaagactg gcttgtctca gtgtttcaac ctcaccaggg ctgtctcttg gtccacacct 180 
cgctccctgt tagtgccgta tgacagcccc catcaaatga ccttggccaa gtcacggttt 240 
ctctgtggtc aaggttggtt ggctgattgg tggaaagtag ggtggaccaa aggaggccac 300 
gtgagcagtc agcaccagtt ctgcaccagc agcgcctccg tcctagtggg tgttcctgtt 360 
tctcctggcc ctgg " 374 

<210> 282 
<211> 404 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 26, 27, 51, 137, 180, 222 

<223> n = A,T,C or G 

<400> 282 

agtgtggtgg aattcccgca tcctanncgc cgactcacac aaggcagagt ngccatggag 60 
aaaattccag tgtcagcatt cttgctcctt gtggccctct cctacactct ggccagagat 120 
accacagtca aacctgnagc caaaaaggac acaaaggact ctcgacccaa actgccccan 180 
accctctcca gaggttgggg tgaccaactc atctggactc anacatatga agaagctcta 240 
tataaatcca agacaagcaa caaacccttg atgattattc atcacttgga tgagtgccca 300 
cacagtcaag ctttaaagaa agtgtttgct gaaaataaag aaatccagaa attggcagag 360 
cagtttgtcc tcctcaatct ggtttatgaa acaactgaca aaca 404 
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<210> 283 
<211> 184 
V212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 26 

<223> n = A,T,C or G 
<400> 283 

agtgtggtgg aattcacttg cttaanttgt gggcaaaaga gaaaaagaag gattgatcag 60 
agcattgtgc aatacagttt cattaactcc ttccctcgct cccccaaaaa tttgaatttt 120 
tttttcaaca ctcttacacc tgttatggaa aatgtcaacc tttgtaagaa aaccaaaata 180 
aaaa 184 

<210> 284 
<211> 421 
<212> DNA 
<213> Homo sapiens 



<220> 

<221> misc_feature. 
<222> 147, 149 
<223> n = A, T,C or G 

<400> 284 

ctattaatcc tgccacaata tttttaatta 
cccatttcac ccactgctct gtttggccgc 
gaggcggata ccctttcctc ggggaanana 
aaaaatgttg gaaagtcgag tggcaaagct 
gtcaaaagat ccagggtgcc tctctctgtt 
acctccagtc accatacaca ggttaccagt 
agtctctaaa tcaatctgaa tggtatcatt 

g 

<210> 285 
<211> 361 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc__feature 

<222> 34, 188 

<223> n ■ A,T,C or G 



cgtacaaaga tctgacatgt cacccaggga 60 
cagtcttttg tctctctctt cagcaatggt 120 
aatccatggt ttgttgccct tgccaataac 180 
gttgccattg gcatctttca cgtgaaccac 240 
ggtgatcaca ccaattcttc ctaggttagc 300 
gtcgaacttg atgaaatcag taatcttgcc 360 
caccttgatg aggggatcgg ggtagcggat 420 



<400> 285 

ctgggtggta actctttatt tcattgtccg gaanaaagat gggagtggga acagggtgga 60 
cactgtgcag gcttcagctt ccactccggg caggattcag gctatctggg accgcaggga 120 
ctgccaggtg cacagccctg gctcccgagg caggcaggca aggtgacggg actggaagcc 180 
cttttcanag ccttggagga gctggtccgt ccacaagcaa tgagtgccac t^ctgcagttt 240 
gcaggggatg gataaacagg gaaacactgt gcattcctca cagccaacag. tgtaggtctt 300 
ggtgaagccc cggcgctgag ctaagctcag gctgttccag ggagccacga aactgcaggt 360 
a 361 



<210> 286 
<211> 336 
<212> DNA 
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<213> Homo sapiens 

4 <220> 
<221> misc_feature 
<222> 40, 68, 75, 127, 262 
<223> n - A,T r 'c or G 

<400> 286 

tttgagtggc agcgccttta tttgtggggg ccttcaaggn agggtcgtgg ggggcagcgg 60 
ggaggaanag ccganaaact gtgtgaccgg ggcctcaggt ggtgggcatt gggggctcct 120 
cttgcanatg cccattggca tcaccggtgc agccattggt ggcagcgggt . accggtcctt 180 
tcttgttcaa catagggtag gtggcagcca cgggtccaac tcgcttgagg ctgggccctg 240 
ggcgctccat tttgtgttcc angagcatgt ggttctgtgg cgggagcccc acgcaggccc 300 
tgaggatgtt ctcgatgcag ctgcgctggc ggaaaa " ^36 

<210> 287 
<211> 301 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 15, 33, 44, 53, 
215, 241, 246 
<223> n = A,T,C or G 

<400> 287 

tgggtaccaa atttntttat 
ttggtacaac ttatanaaaa 
cagggaagtc accccacggc 
gtctcccagg gngngcttgt 
nttgcncaag ttggtcacgt 

g 

<210> 288 
<211> 358 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 39, 143, 226 
<223> n — A, T, C or G 

<400> 288 

aagtttttaa actttttatt tgcatattaa aaaaattgng cattccaata attaaaatca 60 

tttgaacaaa aaaaaaaatg gcactctgat taaactgcat tacagcctgc aggacacctt 120 

gggccagctt ggttttactc tanatttcac tgtcgtccca ccccacttct tccaccccac 180 

ttcttccttc accaacatgc aagttctttc cttccctgcc agccanatag atagacagat 240 

gggaaaggca ggcgcggcct tcgttgtcag tagttctttg atgtgaaagg ggcagcacag 300 

tcatttaaac ttgatccaac ctctttgcat cttacaaagt taaacagcta slaagaagt 358 

<210> 289 
<211> 462 
<212> DNA 
<213> Homo sapiens 



76, 83, 107, 117, 154, 166, 192, 194, 207, 



ttgaaggaat ggnacaaatc aaanaactta agnggatgtt 60 
ggnaaaggaa accccaacat gcatgcnctg ccttggngac 120 
tatggggaaa ttancccgag gcttancttt cattatcact 180 
caaaaanata ttccnccaag ccaaattcgg gcgctcccat 240 
ggtcacccaa ttctttgatg gctttcacct gctcattcag 300 

301 



<220> 
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<221> mis cofeature 
<222> 87, 141, 182, 220, 269, 327 
X223> n = A,T,C or G 



<400> 289 

ggcatcagaa 

gcagacaaca 

ggctgaggga 

anatgaggac 

gagcttgagc 

tcctctccca 

ccaatcctgg 

ctcccagttg 



atgctgttta 
gatgcatagt 
ggaagggtaa 
aaagggactc 
aggccccagg 
ttccctgtct 
cccttgccag 
gattaggacg 



tttctctgct 
tgggganaaa 
naggaaggaa 
ccaagccccc 
gagcctcana 
gcttcanacc 
cttgacagtc 
tcgccctgtt 



gctcccaagc 
gggaggacag 
ggccatcctg 
aaatcatcan 
gccataccag 
acctcccagc 
acagtgcctg 
agcatgctgc 



tggctggcct 
gttccaggat 
gatccccaca 
aaaacaccaa 
ccactgtcta 
taagccccag 
gaattccacc 
cc 



ttgcagagga 60 
agagggtgca 120 
tttcagtctc 180 
ggagcaggag 240 
cttcccatcc 300 
ctccattccc 360 
actgaggctt 420 
462 



<210> 290 
<211> 481 
<212> DNA 
<213> Homo sapiens 



<220> 

<221> misc_feature 
<222> 44, 57, 122, 158, 
<223> n =* A,T,C or G 



304, 325, 352, 405 



<400> 290 

tactttccta aactttatta aagaaaaaag caataagcaa tggnggtaaa tctctanaac 60 
atacccaatt ttctgggctt cctcccccga gaatgtgaca ttttgatttc caaacatgcc 120 
anaagtgtat ggttcccaac tgtactaaag taggtganaa gctgaagtcc tcaagtgttc 180 
atcttccaac ttttcccagt ctgtggtctg tctttggatc agcaataatt gcctgaacag 240 
ctactatggc ttcgttgatt tttgtctgta gctctctgag ct octet atg tgeagcaate 300 
gcanaatttg agcagcttca ttaanaactg catctcctgt gtcaaaacca anaatatgtt 360 
tgtctaaagc aacaggtaag ccctcttttg tttgatttgc cttancaact gcatcctgtg 420 
tcaggcgctc ctgaaccaaa atccgaattg ccttaagcat taccaggtaa tcatcatgac 480 
g 481 

<210> 291 
<211> 381 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<222> 79, 166, 187, 208, 219, 315 

<223> n = A,T,C or G 



<400> 291 

tcatagtaat gtaaaaccat ttgtttaatt 
attagtgact ggttaaggng tgccactgta 
cctggtccta gtccacaagg gtggcaggag 
acaaaanaaa ggaaagctgc cttggcanaa 
tgggaagggg gctccctgtt ggggecgage 
cttagctcct ggcanagggt gagtggggac 
ccagcctggc tttactaaca g 

<210> 292 
<211> 371 
<212> DNA 

<213> Homo sapiens 



ctaaatcaaa tcactttcac aacagtgaaa 60 
catatcatca ttttctgact ggggtcagga 120 
gagggtggag gctaanaaca cagaaaacac 180 
ggatgaggng gtgagcttgc cgaaggatgg 240 
caggagtccc aagtcagctc t^cctgcctta 300 
ctacgaggtt caaaatcaaa tggcatttgg 360 

381 
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<220> 

<<221> mis cofeature 
<222> 32, 55, 72, 151, 189, 292 . 
<223> n = A,T,Cor6 



<400> 292 

gaaaaaataa tccgtttaat tgaaaaacct gnaggatact attccactcc cccanatgag 60 
gaggctgagg anaccaaacc cctacatcac ctcgtagcca cttctgatac tcttcacgag 120 
gcagcaggca aagacaattc ccaaaacctc nacaaaagca attccaaggg ctgctgcagc 180 
taccaccanc acatttttcc tcagccagcc cccaatcttc tccacacagc cctccttatg 240 
gatcgccttc tcgttgaaat taatcccaca gcccacagta acattaatgc ancaggagtc 300 
ggggactcgg ttcttcgaca tggaagggat tttctcccaa tctgtgtagt tagcagcccc 360 
acagcactta a 371 

<210> 293 
<211> 361 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 75, 196, 222 
<223> n — A, T,C or G 

<400> 293 

gatttaaaag aaaacacttt attgttcagc aattaaaagt tagccaaata tgtatttttc 60 

tccataattt attgngatgt tatcaacatc aagtaaaatg ctcattttca tcatttgctt 120 

ctgttcatgt tttcttgaac acgtcttcaa ttttccttcc aaaatgctgc atgccacact 180 

tgaggtaacg aagcanaagt atttttaaac atgacagcta anaacattca tctacagcaa 240 

cctatatgct caatacatgc cgcgtgatcc tagtagtttt ttcacaacct tctacaagtt 300 

tttggaaaac atctgttatg atgactttca tacaccttca cctcaaaggc tttcttgcac 360 
c 361 

<210> 294 

<211> 391 ' 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc__f eature 

<222> 26, 77, 96, 150, 203, 252, 254, 264, 276 
<223> n « A,T,C or G 



<400> 294 

tattttaaag tttaattatg attcanaaaa 
atattgactc tgtatanacc acagttattg 
tattttttat tctgaaaatg atattaatan 
tacatatgcc caaaatggct ganaataaat 
agggcatgca ananaaaatc tcanaatacc 
atttgaagtt atttcagtca tctttgtctt 
cgatgtaatt gaaattcccc tttttatcaa 

<210> 295 
<211> 343 
<212> DNA 
<213> Homo sapiens 



aatcgagcga ataactttct ctgaaaaaat 60 

gggganaagg gctggtaggt taaattatcc 120 s 

aaagtcccgt ttccagtctg attataaaga 180 

acaacaggaa atgcaaaagc tgtaaagcta 240 

caaagnggca acaaggaacg tttggctgga 300 

tggctccatg tttcaggatg qgtgtgaact 360 

t 391 
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<220> 

<221> misc_feature 
«222> 145, 174, 205, 232 
<223> n = A,T,C or G 



<400> 295 

ttcttttgtt ttattgataa cagaaactgt 
aaataataaa gaatgtgtct actgccagca 
acaaatatag agttcttcac accanatggc 
aattgtgctt ctacaaaacc ttcanagcat 
cacatttcca ttattacact tttagtgagc 
tctttcacaa aagccaagcc tcatttacaa 



gcataattac agatttgatg aggaatctgc 60 
aaatacaatt attccatgcc ctctcaacat 120 
tctggtgtaa caaagccatt ttanatgttt 180 
gaggtagttt cttttaccta cnatattttc 240 
taaaatcctt ttaacatagc ctgcggatga 300 
agggtttatt tct 343 



<210> 296 
<211> 241 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> mis cofeature 
<222> 96, 98, 106, 185 
<223> n « A,T,C or G 



<400> 296 

ttcttggata ttggttgttt ttgtgaaaaa 
tatttctcta ctttgccctc ctgatgccca 
cttccacttt ggaaaaaaaa aaaacctgtt 
gatanatcgc tctcaaaatc taaggctctg 
t 

<210> 297 
<211> 391 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> 12, 130 

<223> n = A, T,C or G 



gtttttgttt ttcttctcag tcaactgaat 60 
catgananaa cttaanataa tttctaacag 120 
ttcctcatgg aaccccagga gttgaaagtg 180 
ttcagcttta cattatgtta cctgacgttt 240 

241 



<400> 297 

gttgtggctg anaatgctgg agatgctcag 
cttggtggtg ccctcacatc tggggtcttc 
gtcaggacan accatgtccg tgctaggccc 
ctctcccagg tttctggtcc cgatgggcaa 
catcccacta cccctcacat gctctcactc 
ttcacgaact ctcaaagaaa aggaaggata 
tggaaaagta caaaaagaca gccagaggtg 



ttctctccct cacaaggtag gccacaaatt 60 
aggcaccagc catgcctgcc gaggagtgct 120 
aggcacagcc caaccactcc tcatccaagt 180 
ggatgacccc tccagtggct ggtaccccac 240 
tccatcaggt ccccaatcct ggcttccctc 300 
aaacctaaat aaaccagaca gaagcagctc 360 
t 391 



<210> 298 
<211> 321 
<212> DNA 
<213> Homo sapiens 



<220> 

<221> misc__feature 

<222> 14, 30, 76, 116, 201, 288, 301 
<223> n - A,T,C or G 
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<400> 298 

^caagccaaac tgtntccagc tttattaaan 
ggcaggacat gggcanacaa tcgttaacag 
atggactacc aaaaatcaaa aagccactat 
tgaacaggga aagtttaaag ngagggttga 
ttttcacaag ccgaccctga ctttcaggaa 
natccacaat ctaaaaatgg a 

<210> 299 
<211> 401 
<212> DNA 

<213> Homo sapiens , ; 
<220> 

<221> misc_feature 
<222> 104, 268, 347 
<223> n = A, T, C or G 



atactttcca taaacaatca tggtatttca 60 
tatacaacaa ctttcaaact cccttnttca 120 
aaaacccaat gaagtcttca tctgatgctc 180 
catttcacat ttagcatgtt gtttaacaac 240 
gtgaaatgaa aatggcanaa tttatctgaa 300 

321 



<400> 299 

tatcataaag agtgttgaag tttatttatt 
attggtaaaa aaataaaaca aaaagcattt 
agaagtatca tttttctttg tcaaattata 
tggaattttg tcggtcactt gcactggttg 
agttaaattt tttttgttgg gatttcanat 
caacgtccac accaaattct tgatcaggac 
taggtagtct caca'gccttg cgtgttcgat 

<210> 300 
<211> 188 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 48 

<223> n = A,T,C or G 



atagcaccat tgagacattt tgaaattgga 00 
gaattgtatt tggnggaaca gcaaaaaaag 120 
ctgtttccaa acattttgga aataaataac 180 
acaagattag aacaagagga acacatatgg 240 
agagtttggt ttataaaaag caaacagggc 300 
caccaatgtc atagggngca atatctacaa 360 
attcaaagac t 401 



<400> 300 

tgaatgcttt gtcatattaa gaaagttaaa gtgcaataat gtttgaariac aataagtggt 60 
ggtgtatctt gtttctaata agataaactt ttttgtcttt gctttatctt attagggagt 120 
tgtatgtcag tgtataaaac atactgtgtg gtataacagg cttaataaat tctttaaaag 180 
gaaaaaaa 



<210> 301 
<211> 291 
<212> DNA 
<213> Homo sapiens 



<400> 301 

aagattttgt tttattttat tatggctaga aagacactgt tatagccaaa atcggcaatg 60 

acactaaaga aatcctctgt gcttttcaat atgcaaatat atttcttcca a^agttgccc 120 

tggtgtgact tcaagagttc atgttaactt cttttctgga aacttccttt tcttagttgt 180 

tgtattcttg aagagcctgg gccatgaaga gcttgcctaa gttttgggca gtgaactcct 240 

tgatgttctg gcagtaagtg tttatctggc ctgcaatgag cagcgagtcc a 291 



<210> 302 
<211> 341 
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<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 25 

<223> n = A,T,C or G 



<400> 302 

tgatttttca taattttatt aaatnatcac 

attacactac aatctgatag gagtggtaaa 

aaacgccacc ttttattgtc ctgtcttatt 

ttcatgagcc agcagtggac ttgagttaca 

gaagaagcca tcaaattctt gaggacttga 

cccccgggct gcaggaattc gatatcaagc 

<210> 303 
<211> 361 
<212> DNA 
<213> Homo sapiens 



tgggaaaact aatggttcgc gtatcacaca 60 
accagccaat ggaatccagg taaagtacaa 120 
tctcgggaag gagggttcta ctttacacat 180 
atgtgtaggt tccttgtggt tatagctgca 240 
catctctcgg aaagaagcaa actagtggat 300 
ttatcgatac c 341 



<220> 

<221> misc_feature 

<222> 15 f 27, 92, 124, 121, 183, 

<223> n=A,T,CorG 

<400> 303 

tgcagacagt aaatnaattt tatttgngtt 

gctccgtgac agcccaccaa cccccaaccc 

caanaanatg gaaggatctc acggatctca 

tanacagacg gagttganat gctggaggat 

ccanacttca tcccagccgg gacgtcctcc 

actttgccgc agttccaggn gtcctgcttc 
a 



198, 244, 320 



cacagaacat actaggcgat ctcgacagtc 60 
tntacctcgc agccacccta aaggcgactt 120 
ttcctaatgg tccgccgaag tctcacacag 180 
gcagtcacct cctaaactta cgacccacca 240 
cccacccgag tcctccccat ttcttctcct 300 
caccagtccc acaaagctca ataaatacca 360 

361 



<210> 304 
<211> 301 
<212> DNA 
<213> Homo sapiens 



<220> 

<221> misc_feature 
<222> 23, 104, 192 
<223> n — A, T,C or G 



<400> 304 

ctctttacaa cagcctttat ttncggccct 
tagctccgcc cgccaggctc tgtgccgcct 
ctcaggggct tgaggccgta ctcccccagc 
aaggtcagcc anaacaggtc gtcctgcaca 
tgggccacgg tctgcgtcag ccgcacctcg 
a 

<210> 305 
<211> 331 
<212> DNA 
<213> Homo sapiens 



tgatcctgct cggatgctgg tggaggccct 60 
ccccgcaggc gcanattcat gaacacggtg 120 
gggagctggt cctccagggg cttcccctcg 180 
ccctccagcc cgctcacttg ctgcttcagg 240 
taggtgctgc tgcggccctt gttattcctc 300 

1 301 
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<220> 

<221> misc_feature 
,<222> 3, 36, 60, 193, 223 
<223> n = A,T,C or G 

<400> 305 

ganaggctag taacatcagt tttattgggt tggggnggca accatagcct ggctgggggn 60 
ggggctggcc ctcacaggtt gttgagttcc agcagggtct ggtccaaggt ctggtgaatc 120 
tcgacgttct cctccttggc actggccaag gtctcttcta ggtcatcgat ggttttctcc 180 
, aactttgcca canacctctc ggcaaactct gctcgggtct cancctcctt cagcttctcc 240 
tccaacagtt tgatctcctc ttcatattta tcttctttgg gggaatactc ctcctctgag 300 
gccatcaggg acttgagggc ctggtccatg g 33! 

<210> 306 
<211> 457 
<212> DNA 
<213> Homo sapiens 

<400> 306 

aatatgtaaa ggtaataact tttattatat taaagacaat gcaaacgaaa aacagaattg 60 
agcagtgcaa aatttaaagg actgttttgt tctcaaagtt gcaagtttca aagccaaaag 120 
aattatatgt atcaaatata taagtaaaaa aaagttagac tttcaagcct gtaatcccag 180 
cactttggga ggctgaggca ggtggatcac taacattaaa aagacaacat tagattttgt 240 
cgatttatag caattttata aatatataac tttgtcactt ggatcctgaa gcaaaataat 300 
aaagtgaatt tgggattttt gtacttggta aaaagtttaa caccctaaat tcacaactag 360 
tggatccccc gggctgcagg aattcgatat caagcttatc gataccgtcg acctcgaggg 420 
ggggcccggt acccaattcg ccctatagtg agtcgta 457 

<210> 307 
<211> 491 
<212> DNA 
<213> Homo sapiens 

<400> 307 

gtgcttggac ggaacccggc gctcgttccc caccccggcci ggccgcccat agccagccct 60 
ccgtcacctc ttcaccgcac cctcggactg ccccaaggcc cccgccgccg ctccagcgcc 120 
gcgcagccac cgccgccgcc gccgcctctc cttagtcgcc gccatgacga ccgcgtecac 180 
ctcgcaggtg cgccagaact accaccagga ctcagaggcc gccatcaacc gccagatcaa 240 
cctggagctc tacgcptcct acgtttacct gtccatgtct tactactttg accgcgatga 300 
tgtggctttg aagaactttg ccaaatactt tcttcaccaa tctcatgagg agagggaaca 360 
tgctgagaaa ctgatgaagc tgcagaacca acgaggtggc cgaatcttcc ttcaggatat 420 
caagaaacca gactgtgatg actgggagag cgggctgaat gcaatggagt gtgcattaca 480 
tttggaaaaa a 491 

<210> 308 
<211> 421 
<212> DNA 
<213> Homo sapiens 

<400> 308 

ctcagcgctt cttctttctt ggtttgatcc tgactgctgt catggcgtgc cctctggaga 60 

aggccctgga tgtgatggtg tccaccttcc acaagtactc gggcaaagag ggtgacaagt 120 

tcaagctcaa caagtcagaa ctaaaggagc tgctgacccg ggagctgccc agcttcttgg 180 

ggaaaaggac agatgaagct gctttccaga agctgatgag caacttggac agcaacaggg 240 

acaacgaggt ggacttccaa gagtactgtg tcttcctgtc ctgcatcgcc atgatgtgta 300 

acgaattctt tgaaggcttc ccagataagc agcccaggaa gaaatgaaaa ctcctctgat 360 

gtggttgggg ggtctgccag ctggggccct ccctgtcgcc agtgggcact tttttttttc 420 
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<210> 309 
<211> 321 
<212> DNA 
<213> Homo sapiens 



<400> 309 

accaaatggc ggatgacgcc ggtgcagcgg 
tggggaaccg cggtggcttc cgcggaggtt 
/ gccgtggacg gggccggggc cgaggccgcg 
agtggatgcc cgtcaccaag ttgggccgct 
aggagatcta tctcttctcc ctgcccatta 
gggcctctct caaggatgag g 



gggggcccgg gggccctggt ggccctggga 60 
tcggcagtgg catccggggc cggggtcgcg 120 
gagctcgcgg aggcaaggcc gaggataagg 180 
tggtcaagga catgaagatc aagtccctgg 240 
aggaatcaga gatcattgat ttcttcctgg 300 

321 



<210> 310 
<211> 381 
<212> DNA 
<213> Homo sapiens 



<400> 310 

ttaaccagcc atattggctc aataaatagc 
tcagtgccta tttttcctgg aaactcaatt 
ctgttgtcat tttcattcgg tgacattctc 
cacatttttc atttatagat gtttgcatcc 
ctcattggat ggcttttttt tttttcctcc 
taatgtatgt ttacatctct ttgcaaattc 
aatgtaacaa catactgtga a 

<210> 311 
<211> 538 
<212> DNA 
<213> Homo sapiens 



ttcggtaagg agttaatttc cttctagaaa 60 
ttaaatagtc caattccatc tgaagccaag 120 
tcccatgaca cccagaaggg gcagaagaac 180 
tttgtattaa aattattttg aaggggttgc 240 
agggagaagg ggagaaatgt acttggaaat 300 
ctgtacatag agatatattt tttaagtgtg 360 

381 



<400> 311 

tttgaattta caccaagaac ttctcaataa 
cataccacaa gagaagttaa tttcttaaca 
accaagttct gatatctttt aaagacatag 
tgaaaatatc cttgttgtgt attaggtttt 
gtcatcagta ccctcctatt cagctcccca 
ggttttcttc ttatttttag ataattcaag 
tttatggtaa actcttttaa agaaaattta 
ttaaatcttt atcatagact ctgtacatat 
atcatcggtg ggatgacaga acaaacatat 

<210> 312 
<211> 176 
<212> DNA 
<213> Homo sapiens 



aagaaaatca tgaatgctcc acaatttcaa 60 
ttgtgttcta tgattatttg taagaccttc 120 
ttcaaaattg cttttgaaaa tctgtattct 180 
taaataccag ctaaaggatt acctcactga 240 
agatgatgtg tttttgctta ccctaagaga 300 
tgcttagata aattatgttt tctttaagtg 360 
atatgttata gctgaatctt tttggtaact 420 
gttcaaatta gctgcttgcc tgatgtgtgt 480 
ttatgatcat gaataatgtg ctttgtaa 538 



<400> 312 

ggaggagcag ctgagagata gggtcagtga 
tcatagaacc attgccttag aattattgta 
gttttgttct ttgtgaacat gggtattttg 



atgcggttca gcctgctacc tctcctgtct 60 
tgacacgttt tttgttggtt aagctgtaag 120 
aggggagggt ggagggagta gggaag 176 



<210> 313 
<211> 396 
<212> DNA 
<213> Homo sapiens 
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<400> 313 

.ccagcacccc caggccctgg gggacctggg ttctcagact gccaaagaag ccttgccatc 60 
tggcgctccc atggctcttg caacatctcc ccttcgtttt tgagggggtc atgccggggg 120 
agccaccagc ccctcactgg gttcggagga gagtcaggaa gggccaagca cgacaaagca 180 
gaaacatcgg atttggggaa cgcgtgtcaa tcccttgtgc cgcagggctg ggcgggagag 240 
actgttctgt tccttgtgta actgtgttgc tgaaagacta cctcgttctt gtcttgatgt 300 
gtcaccgggg caactgcctg ggggcgggga tgggggcagg gtggaagcgg ctccccattt 360 
tataccaaag gtgctacatc tatgtgatgg gtgggg 396 

<210> 314 

<211> 311 

<212> DNA 

<213> Homo sapiens 

<400> 314 

cctcaacatc ctcagagagg actggaagcc agtccttacg ataaactcca taatttatgg 60 
cctgcagtat ctcttcttgg agcccaaccc cgaggaccca ctgaacaagg aggccgcaga 120 
ggtcctgcag aacaaccggc ggctgtttga gcagaacgtg cagcgctcca tgcggggtgg 180 
ctacatcggc tccacctact ttgagcgctg cctgaaatag ggttggcgca tacccacccc 240 
cgccacggcc acaagccctg gcatcccctg caaatattta ttgggggcca tgggtagggg 300 
tttggggggc g 311 

<210> 315 
<211> 336 
<212> DNA 
<213> Homo sapiens 



<400> 315 

tttagaacat ggttatcatc caagactact 
aatccacatt cctcttgagt tctgcagctt 
cgtagaatca catgatctga ggaccattca 
gtcttccata aagttttgca tggagcaaac 
agccctctaa aagcataggg cttagcctgc 
gttttgtaaa cactatagca, tctgttaaga 

<210> 316 
<211> 436 
<212> DNA 
<213> Homo sapiens 



ctaccctgca acattgaact cccaagagca 60 
ctgtgtaaat agggcagctg tcgtctatgc 120 
tggaagctgc taaatagcct agtctgggga 180 
aaacaggatt aaactaggtt tggttccttc 240 
aggcttcctt gggctttctc tgtgtgtgta 300 
tccagt i 336 



<400> 316 

aacatggtct gcgtgcctta agagagacgc ttcctgcaga acaggacctg actacaaaga 60 
atgtttccat tggaattgtt ggtaaagact tggagtttac aatctatgat gatgatgatg 120 
tgtctccatt cctggaaggt cttgaagaaa gaccacagag aaaggcacag cctgctcaac 180 
ctgctgatga acctgcagaa aaggctgatg aaccaatgga acattaagtg ataagccagt 240 
ctatatatgt attatcaaat atgtaagaat acaggcacca catactgatg acaataatct 300 
atactttgaa ccaaaagttg cagagtggtg gaatgctatg ttttaggaat cagtccagat 360 
gtgagttttt tccaagcaac ctcactgaaa cctatataat ggaatacatt tttctttgaa 420 
agggtctgta taatca 436 

<210> 317 
<211> 196 
<212> DNA 
<213> Homo sapiens 

<400> 317 

tattccttgt gaagatgata tactattttt gttaagcgtg tctgtattta tgtgtgagga 60 
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gctgctggct ttfcagtgcgc gtgcacgtgg agagctggtg cccggagatt ggacggcctg 120 
atgctccctc ccctgccctg gtccagggaa gctggccgag ggtcctggct cctgaggggc 180 
atctgcccct ccccca 196 

<210> 318 
<211> 381 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> miscfeature 

<222> 8, 9, 102, 122, 167, 182, 193, 235, 253, 265, 266, 290, 321, 
378 

<223> n « A,T,C or G 



<400> 318 

gacgcttnng ccgtaacgat gatcggagac 
gccggggcgg tgctgaactt taagctgaaa 
tncagggagc ccaacacagg tgacaacatc 
cnaatcttca tcnccctgtg gaacatcttc 
tcttgaatcc cancgatgaa accannaact 
tccattcctg atgacttcaa naatgttttt 
tccaagctcg tggtgggngg a 



atcctgctgt tcgggacgtt gctgatgaat 60 
aagaaggaca cncagggctt tggggaggag 120 
cgggaattct tgctgancct cagatacttt 180 
atgatgttct gcatgattgt gctgntcggc 240 
cactttcccg ggatgccgan tctccattcc 300 
gaccaaaaaa ccgacaacct tcccagaaag 360 

381 



<210> 319 
<211> 506 
<212> DNA 
<213> Homo sapiens 



<400> 319 

ctaagcttta cgaatggggt gacaacttat 
tttgtaaata cctttgttat aattgatagg 
cctctgagca gtgtatgtca ggacttgttc 
ttatacaggt agagatgtat gcagatgtgt 
ccattgatgt atgcatctct tggctgtact 
ctttgctaat attttaatgg tatagatctg 
tctgttgctg tgtgtttcat tttaaattga 
actctgccaa tgcttttatc tagaggcgtg 
tcccaagaaa ggcaggatta catctt 



gataaaaact agagctagtg aattagccta 60 
atacatcttg gacatggaat tgttaagcca 120 
attaggttgg cagcagaggg gcagaaggaa 180 
ccatatatgt ccatatttac attttgatag 240 
ataagaacac attaattcaa tggaaataca 300 
ctaatgaatt ctcttaaaaa catactgtat 360 
gcattaaggg aatgcagcat ttaaatcaga 420 
ttgccatttt tgtcttatat gaaatttctg 480 

506 



<210> 320 
<211> 351 
<212> DNA 
<213> Homo sapiens 



<400> 320 

ctgacctgca ggacgaaacc atgaagagcc 
cggtagtaac tttgtgttat gaatcacatg 
tcattaacag gagaaatgca aataccttca 
tccaagagag gatccgagaa cgctctaagc 
atgactacag actttgcgaa cgctacgcca 
gctacttcag gaagcgccga gggaccaaat 



tgatccttct tgccatcctg gccgccttag 60 
aaagcatgga atcttatgaa cttaatccct 120 
tatcccctca gcagagatgg agagctaaag 180 
ctgtccacga gctcaatagg ^aagcctgtg 240 
tggtttatgg atacaatgct gcctataatc 300 
gagactgagg gaagaaaaaa a 351 



<210> 321 
<211> 421 
<212> DNA 
<213> Homo sapiens 



WO 02/00174 



135 



PCT7US01/21065 



<400> 321 ' 

ctcggaggcg ttcagctgct tcaagatgaa gctgaacatc tccttcccag ccactggctg 60 

.ccagaaactc attgaagtgg acgatgaacg caaacttcgt actttctatg agaagcgtat 120 

ggccacagaa gttgctgctg acgctctggg tgaagaatgg aagggttatg tggtccgaat 180 

cagtggtggg aacgacaaac aaggtttccc catgaagcag ggtgtcttga cccatggccg 240 

tgtccgcctg ctactgagta aggggcattc ctgttacaga ccaaggagaa ctggagaaag 300 

aaagagaaaa tcagttcgtg gttgcattgt ggatgcaaat ctgagcgttc tcaacttggt 360 

tattgtaaaa aaaggagaga aggatattcc tggactgact gatactacag tgcctcgccg 420 

c 421 

<210> 322 
<211> 521 
<212> DNA 
<213> Homo sapiens 

<400> 322 

agcagctctc ctgccacagc tcctcacccc ctgaaaatgt tcgcctgctc caagtttgtc 60 
tccactccct ccttggtcaa gagcacctca cagctgctga gccgtccgct atctgcagtg 120 
gtgctgaaac gaccggagat actgacagat gagagcctca gcagcttggc agtctcatgt 180 
ccccttacct cacttgtctc tagccgcagc ttccaaacca gcgccatttc aagggacatc 240 
gacacagcag ccaagttcat tggagctggg gctgccacag ttggggtggc tggttctggg 300 
gctgggattg gaactgtgtt tgggagcctc atcattggtt atgccaggaa cccttctctg 360 
aagcaacagc tcttctccta cgccattctg ggctttgccc tctcggaggc catggggctc 420 
ttttgtctga tggtagcctt tctcatcctc tttgccatgt gaaggagccg tctccacctc 480 
ccatagttct cccgcgtctg gttggccccg tgtgttcctt t 52 1 

<210> 323 
<211> 435 
<212> DNA 
<213> Homo sapiens 

<400> 323 

ccgaggtcgc acgcgtgaga cttctccgcc gcagacgccg ccgcgatgcg ctacgtcgcc 60 

tcctacctgc tggctgccct agggggcaac tcctccccca gcgccaagga catcaagaag 120 

atcttggaca gcgtgggtat cgaggcggac gacgaccggc tcaacaaggt tatcagtgag 180 

ctgaatggaa aaaacattga agacgtcatt gcccagggta ttggcaagct tgccagtgta 240 

cctgctggtg gggctgtagc cgtctctgct gccccaggct ctgcagcccc tgctgctggt 300 

tctgcccctg ctgcagcaga ggagaagaaa gatgagaaga aggaggagtc tgaagagtca 360 

gatgatgaca tgggatttgg cctttttgat taaattcctg ctcccctgca aataaagcct 420 

ttttacacat ctcaa * 435 

<210> 324 
<211> 521 
<212> DNA 
<213> Homo sapiens 

<400> 324 

aggagatcga ctttcggtgc ccgcaagacc agggctggaa cgccgagatc acgctgcaga 60 
tggtgcagta caagaatcgt caggccatcc tggcggtcaa atccacgcgg cagaagcagc 120 
agcacctggt ccagcagcag cccccctcgc agccgcagcc gcagccgcag ctccagcccc 180 
aaccccagcc tcagcctcag ccgcaacccc agccccaatc acaaccccag cctcagcccc 240 
aacccaagcc tcagccccag cagctccacc cgtatccgca tccacatcca catccacact 300 
ctcatcctca ctcgcaccca caccctcacc cgcacccgca tccgcaccaa ataccgcacc 360 
cacacccaca gccgcactcg cagccgcacg ggcaccggct tctccgcagc acctccaact 420 
ctgcctgaaa ggggcagctc ccgggcaaga caaggttttg aggacttgag gaagtgggac 480 
gagcacattt ctattgtctt cacttggatc aaaagcaaaa c ~ - . - 



<210> 325 
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<211> 451 
<212> DNA 
£213> Homo sapiens 

<400> 325 

attttcattt ccattaacct ggaagctttc atgaatattc tcttctttta aaacatttta €0 
acattattta aacagaaaaa gatgggctct ttctggttag ttgttacatg atagcagaga 120 
tatttttact tagattactt tgggaatgag agattgttgt cttgaactct ggcactgtac 180 
agtgaatgtg tctgtagttg tgttagtttg cattaagcat gtataacatt caagtatgtc 240 
atccaaataa gaggcatata cattgaattg tttttaatcc tctgacaagt tgactcttcg 300 
acccccaccc ccacccaaga cattttaata gtaaatagag agagagagaa gagttaatga 360 
acatgaggta gtgttccact ggcaggatga cttttcaata gctcaaatca atttcagtgc 420 
ctttatcact tgaattatta acttaatttg a 451 

<210> 326 
<211> 421 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 296 

<223> n = A,T,C or G 
<400> 326 

cgcggtcgta agggctgagg atttttggtc cgcacgctcc tgctcctgac tcaccgctgt 60 
tcgctctcgc cgaggaacaa gtcggtcagg aagcccgcgc gcaacagcca tggcttttaa 120 
ggataccgga aaaacacccg tggagccgga ggtggcaatt caccgaattc gaatcaccct 180 
aacaagccgc aacgtaaaat ccttggaaaa ggtgtgtgct gacttgataa gaggcgcaaa 240 
agaaaagaat ctcaaagtga aaggaccagt tcgaatgcct accaagactt tgagantcac 300 
tacaagaaaa actccttgtg gtgaaggttc taagacgtgg gatcgtttcc agatgagaat 360 
tcacaagcga ctcattgact tgcacagtcc ttctgagatt gttaagcaga ttacttccat 420 
c ^ 421 

<210> 327 
<211> 456 



<212> DNA 

<213> Homo sapiens 

<400> 327 

atcttgacga ggctgcggtg tctgctgcta 
cgacaagaag aagaaggacg ctggaaagtc 
atccgggggc aaggccaaaa agaagaagtg 
taacttagtc ttgtttgaca aagctaccta 
taaacttata accccagctg tggtctctga 
ggcagccctt caggagctcc ttagtaaagg 
tcaagtaatt tacaccagaa ataccaaggg 
atgaataggt ccaaccagct gtacatttgg 

<210> 328 
<211> 471 
<212> DNA 
<213> Homo sapiens 

<400> 328 

gtggaagtga catcgtcttt aaaccctgcg 
cagggaagac agggcgacct ggaagtccaa 
tgattatccg aaatgtttca ttgtgggagc 



ttctccgagc ttcgcaatgc cgcctaagga 60 
ggccaagaaa gacaaagacc cagtgaacaa 120 
gtccaaaggc aaagttcggg acaagctcaa 180 
tgataaactc tgtaaggaag ttcccaacta 240 
gagactgaag attcgaggct ccctggccag 300 
acttatcaaa ctggtttcaa agcacagagc 360 
tggagatgct ccagctgctg gtgaagatgc 420 
aaaaat 456 



tggcaatccc tgacgcaccg ccgtgatgcc 60 
ctacttcctt aagatcatcc aactattgga 120 
agacaatgtg ggctccaagc agatgcagca 180 
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gatccgcatg tqccttcgcg ggaaggctgt ggtgctgatg ggcaagaaca ccatgatgcg 240 
caaggccatc cgagggcacc tggaaaacaa cccagctctg gagaaactgc tgcctcatat 300 
ccgggggaat gtgggctttg tgttcaccaa ggaggacctc actgagatca gggacatgtt 360 
^gctggccaat aaggtgccag ctgctgcccg tgctggtgcc attgccccat gtgaagtcac 420 
tgtgccagcc cagaacactg gtctcgggcc cgagaagacc tcctttttcc a 471 

<210> 329 
<211> 278 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> misc_feature 
<222> 154, 204 
<223> n = A,T f C or G 

<400> 329 

gtttaaactt aagcttggta ccgagctcgg atccactagt ccagtgtggt ggaattctag 60 

aaattgagat gcccccccag gccagcaaat gttccttttt gttcaaagtc tatttttatt 120 

ccttgatatt tttctttttt tttttttttt ttgnggatgg ggacttgtga atttttctaa 180 

aggtgctatt taacatggga gganagcgtg tgcggctcca gcccagcccg ctgctcactt 24 0 

tccaccctct ctccacctgc ctctggcttc tcaggcct 278 

<210> 330 
<211> 338 
<212> DNA 
<213> Homo sapiens 

<400> 330 

ctcaggcttc aacatcgaat acgccgcagg ccccttcgcc ctattcttca tagccgaata 60 
cacaaacatt attataataa acaccctcac cactacaatc ttcctaggaa caacatatga 120 
cgcactctcc cctgaactct acacaacata ttttgtcacc aagaccctac ttctaacctc 180 
cctgttctta tgaattcgaa cagcataccc ccgattccgc tacgaccaac tcatacacct 240 
cctatgaaaa aacttcctac cactcaccct agcattactt atatgatatg tctccatacc 300 
cattacaatc tccagcattc cccctcaaac ctaaaaaa , 338 

<210> 331 

<211> 2820 

<212> DNA 

<213> Homo sapiens 

<400> 331 

tggcaaaatc ctggagccag aagaaaggac agcagcattg atcaatctta cagctaacat 60 
gttgtacctg gaaaacaatg cccagactca atttagtgag ccacagtaca cgaacctggg 120 
gctcctgaac agcatggacc agcagattcg gaacggctcc tcgtccacca gtccctataa 180 
cacagaccac gcgcagaaca gcgtcacggc gccctcgccc tacgcacagc ccagccccac 240 
cttcgatgct ctctctccat cacccgccat cccctccaac accgactacc caggcccgca 300 
cagttccgac gtgtccttcc agcagtcgag caccgccaag tcggccacct ggacgtattc 360 
cactgaactg aagaaactct actgccaaat tgcaaagaca tgccccatcc agatcaaggt 420 
gatgacccca cctcctcagg gagctgttat ccgcgccatg cctgtctaca aaaaagctga 480 
gcacgtcacg gaggtggtga agcggtgccc caaccatgag ctgagccgtg agttcaacga 540 
gggacagatt gcccctccta gtcatttgat tcgagtagag gggaacagcc atgcccagta 600 
tgtagaagat cccatcacag gaagacagag tgtgctggta ccttatgagc caccccaggt 660 
tggcactgaa ttcacgacag tcttgtacaa tttcatgtgt aacagcagtt gtgttggagg 720 
gatgaaccgc cgtccaattt taatcattgt tactctggaa accagagatg ggcaagtcct 780 
gggccgacgc tgctttgagg cccggatctg tgcttgccca ggaagagaca ggaaggcgga 840 
tgaagatagc atcagaaagc agcaagtttc ggacagtaca aagaacggtg atggtacgaa 900 
gcgcccgttt cgtcagaaca cacatggtat ccagatgaca tccatcaaga aacgaagatc 960 
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cccagatgat giactgttat acttaccagt gaggggccgt gagacttatg aaatgctgtt 1020 
gaagatcaaa gagtccctgg aactcatgca gtaccttcct cagcacacaa ttgaaacgta 1080 
caggcaacag caacagcagc agcaccagca cttacttcag aaacagacct caatacagtc 1140 
tccatcttca tatggtaaca gctccccacc tctgaacaaa atgaacagca tgaacaagct 1200 
gccttctgtg agccagctta tcaaccctca gcagcgcaac gccctcactc ctacaaccat 1260 
tcctgatggc atgggagcca acattcccat gatgggcacc cacatgccaa tggctggaga 1320 
catgaatgga ctcagcccca cccaggcact ccctccccca ctctccatgc catccacctc 1380 
ccactgcaca cccccacctc cgtatcccac agattgcagc attgtcagtt tcttagcgag 1440 
gttgggctgt tcatcatgtc tggactattt cacgacccag gggctgacca ccatctatca 1500 
gattgagcat tactccatgg atgatctggc aagtctgaaa atccctgagc aatttcgaca 1560 
tgcgatctgg aagggcatcc tggaccaccg gcagctccac gaattctcct ccccttctca 1620 
tctcctgcgg accccaagca gtgcctctac agtcagtgtg ggctccagtg agacccgggg 1680 
tgagcgtgtt attgatgctg tgcgattcac cctccgccag accatctctt tcccaccccg 1740 
agatgagtgg aatgacttca actttgacat ggatgctcgc cgcaataagc aacagcgcat 1800 
caaagaggag ggggagtgag cctcaccatg tgagctcttc ctatccctct cctaactgcc 1860 
agccccctaa aagcactcct gcttaatctt caaagccttc tccctagctc ctccccttcc 1920 
tcttgtctga tttcttaggg gaaggagaag taagaggcta cctcttacct aacatctgac 1980 
ctggcatcta attctgattc tggctttaag ccttcaaaac tatagcttgc agaactgtag 2040 
ctgccatggc taggtagaag tgagcaaaaa agagttgggt gtctccttaa gctgcagaga 2100 
tttctcattg acttttataa agcatgttca cccttatagt ctaagactat atatataaat 2160 
gtataaatat acagtataga tttttgggtg gggggcattg agtattgttt aaaatgtaat 2220 
ttaaatgaaa gaaaattgag ttgcacttat tgaccatttt ttaatttact tgttttggat 2280 
ggcttgtcta tactccttcc cttaaggggt atcatgtatg gtgataggta tctagagctt 2340 
aatgctacat gtgagtgcga tgatgtacag attctttcag ttctttggat tctaaataca 2400 
tgccacatca aacctttgag tagatccatt tccattgctt attatgtagg taagactgta 2460 
gatatgtatt cttttctcag tgttggtata ttttatatta ctgacatttc ttctagtgat 2520 
gatggttcac gttggggtga tttaatccag ttataagaag aagttcatgt ccaaacggtc 2580 
ctctttagtt tttggttggg aatgaggaaa attcttaaaa ggcccatagc agccagttca 2640 
aaaacacccg acgtcatgta tttgagcata tcagtaaccc ccttaaattt aatacccaga 2700 
taccttatct tacaatgttg attgggaaaa catttgctgc ccattacaga ggtattaaaa 2760 
ctaaatttca ctactagatt gactaactca aatacacatt tgctactgtt gtaagaattc 2820 



<210> 332 

<211> 2270 i 

<212> DNA 

<213> Homo sapiens 

<400> 332 

tcgttgatat caaagacagt tgaaggaaat gaattttgaa acttcacggt gtgccaccct 60 
acagtactgc cctgaccctt acatccagcg tttcgtagaa acccagctca tttctcttgg 120 
aaagaaagtt attaccgatc caccatgtcc cagagcacac agacaaatga attcctcagt 180 
ccagaggttt tccagcatat ctgggatttt ctggaacagc ctatatgttc agttcagccc 240 
attgacttga actttgtgga tgaaccatca gaagatggtg cgacaaacaa gattgagatt 300 
agcatggact gtatccgcat gcaggactcg gacctgagtg accccatgtg gccacagtac 360 
acgaacctgg ggctcctgaa cagcatggac cagcagattc agaacggctc ctcgtccacc 420 
agtccctata acacagacca cgcgcagaac agcgtcacgg cgccctcgcc ctacgcacag 480 
cccagctcca ccttcgatgc tctctctcca tcacccgcca tcccctccaa caccgactac 540 
ccaggcccgc acagtttcga cgtgtccttc cagcagtcga gcaccgccaa gtcggccacc 600 
tggacgtatt ccactgaact gaagaaactc tactgccaaa ttgcaaagac atgccccatc 660 
cagatcaagg tgatgacccc acctcctcag ggagctgtta tccgcgccat cjcctgtctac 720 
aaaaaagctg agcacgtcac ggaggtggtg aagcggtgcc ccaaccatga gctgagccgt 780 
gaattcaacg agggacagat tgcccctcct agtcatttga ttcgagtaga ggggaacagc 840 
catgcccagt atgtagaaga tcccatcaca ggaagacaga gtgtgctggt accttatgag 900 
ccaccccagg ttggcactga attcacgaca gtcttgtaca atttcatgtg taacagcagt 960 
tgtgttggag ggatgaaccg ccgtccaatt ttaatcattg ttactctgga aaccagagat 1020 
gggcaagtcc tgggccgacg ctgctttgag gcccggatct gtgcttgccc aggaagagac 1080 
aggaaggcgg atgaagatag catcagaaag cagcaagttt cggacagtac aaagaacggt 1140 
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gatggtacga atfcgcccgtt tcgtcagaac acacatggta tccagatgac atccatcaag 1200 
aaacgaagat ccccagatga tgaactgtta tacttaccag tgaggggccg tgagacttat 1260 
^gaaatgctgt tgaagatcaa agagtccctg gaactcatgc agtaccttcc tcagcacaca 1320 
attgaaacgt acaggcaaca gcaacagcag cagcaccagc acttacttca gaaacagacc 1380 
tcaatacagt ctccatcttc atatggtaac agctccccac ctctgaacaa aatgaacagc 1440 
atgaacaagc tgccttctgt gagccagctt atcaaccctc agcagcgcaa cgccctcact 1500 
cctacaacca ttcctgatgg catgggagcc aacattccca tgatgggcac ccacatgcca 1560 
atggctggag acatgaatgg actcagcccc acccaggcac tccctccccc actctccatg 1620 
ccatccacct cccactgcac acccccacct ccgtatccaa cagattgcag cattgtcggt 1680 
ttcttagcga ggttgggctg ttcatcatgt ctggactatt tcacgaccca ggggctgacc 1740 
accatctatc agattgagca ttactccatg gatgatctgg caagtctgaa aatccctgag 1800 
caatttcgac atgcgatctg gaagggcatc ctggaccacc ggcagctcca cgaattctcc 1860 
tccccttctc atctcctgcg gaccccaagc agtgcctcta cagtcagtgt gggctccagt 1920 
gagacccggg gtgagcgtgt tattgatgct gtgcgattca ccctccgcca gaccatctct 1980 
ttcccacccc gagatgagtg gaatgacttc aactttgaca tggatgctcg ccgcaataag 2040 
caacagcgca tcaaagagga gggggagtga gcctcaccat gtgagctctt cctatccctc 2100 
tcctaactgc cagcccccta aaagcactcc tgcttaatct tcaaagcctt ctccctagct 2160 
cctccccttc ctcttgtctg atttcttagg ggaaggagaa gtaagaggct acctcttacc 2220 
taacatctga cctggcatct aattctgatt ctggctttaa gccttcaaaa 2270 

<210> 333 

<211> 2816 

<212> DNA 

<213> Homo sapiens 

<400> 333 

tcgttgatat caaagacagt tgaaggaaat gaattttgaa acttcacggt gtgccaccct 60 
acagtactgc cctgaccctt acatccagcg tttcgtagaa acccagctca tttctcttgg 120 
aaagaaagtt attaccgatc caccatgtcc cagagcacac agacaaatga attcctcagt 180 
ccagaggttt tccagcatat ctgggatttt ctggaacagc ctatatgttc agttcagccc 240 
attgacttga actttgtgga tgaaccatca gaagatggtg cgacaaacaa gattgagatt 300 
agcatggact gtatccgcat gcaggactcg gacctgagtg accccatgtg gccacagtac 360 
acgaacctgg ggctcctgaa cagcatggac cagcagattc agaacggctc ctcgtccacc 420 
agtccctata acacagacca cgcgcagaac agcgtcacgg cgccctcg.cc ctacgcacag 480 
cccagctcca ccttcgatgc tctctctcca tcacccgcca tcccctccaa caccgactac 540 
ccaggcccgc acagtttcga cgtgtccttc cagcagtcga gcaccgccaa gtcggccacc 600 
tggacgtatt ccactgaact gaagaaactc tactgccaaa ttgcaaagac atgccccatc 660 
cagatcaagg tgatgacccc acctcctcag ggagctgtta tccgcgccat gcctgtctac 720 
aaaaaagctg agcacgtcac ggaggtggtg aagcggtgcc ccaaccatga gctgagccgt 780 
gaattcaacg agggacagat tgcccctcct agtcatttga ttcgagtaga ggggaacagc 840 
catgcccagt atgtagaaga tcccatcaca ggaagacaga gtgtgctggt accttatgag 900 
ccaccccagg ttggcactga attcacgaca gtcttgtaca atttcatgtg taacagcagt 960 
tgtgttggag ggatgaaccg ccgtccaatt ttaatcattg ttactctgga aaccagagat 1020 
gggcaagtcc tgggccgacg ctgctttgag gcccggatct gtgcttgccc aggaagagac 1080 
aggaaggcgg atgaagatag catcagaaag cagcaagttt cggacagtac aaagaacggt 1140 
gatggtacga agcgcccgtt tcgtcagaac acacatggta tccagatgac atccatcaag 1200 
aaacgaagat ccccagatga tgaactgtta tacttaccag tgaggggccg tgagacttat 1260 
gaaatgctgt tgaagatcaa agagtccctg gaactcatgc agtaccttcc tcagcacaca 1320 
attgaaacgt acaggcaaca gcaacagcag cagcaccagc acttacttca gaaacatctc 1380 
ctttcagcct gcttcaggaa tgagcttgtg gagccccgga gagaaactcc aaaacaatct 1440 
gacgtcttct ttagacattc caagccccca aaccgatcag tgtacccata gagccctatc 1500 
tctatatttt aagtgtgtgt gttgtatttc catgtgtata tgtgagtgtg tgtgtgtgta 1560 
tgtgtgtgcg tgtgtatcta gccctcataa acaggacttg aagacacttt ggctcagaga 1620 
cccaactgct caaaggcaca aagccactag tgagagaatc ttttgaaggg actcaaacct 1680 
ttacaagaaa ggatgttttc tgcagatttt gtatccttag accggccatt ggtgggtgag 1740 
gaaccactgt gtttgtctgt gagctttctg ttgtttcctg ggagggaggg gtcaggtggg 1800 
gaaaggggca ttaagatgtt tattggaacc cttttctgtc ttcttctgtt gtttttctaa 1860 
aattcacagg gaagcttttg agcaggtctc aaacttaaga tgtcttttta agaaaaggag 1920 
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aaaaaagttg tt'attgtctg tgcataagta agttgtaggt gactgagaga ctcagtcaga 1980 
cccttttaat gctggtcatg taataatatt gcaagtagta agaaacgaag gtgtcaagtg 2040 
^tactgctggg cagcgaggtg atcattacca aaagtaatca actttgtggg tggagagttc 2100 
tttgtgagaa cttgcattat ttgtgtcctc ccctcatgtg taggtagaac atttcttaat 2160 
gctgtgtacc tgcctctgcc actgtatgtt ggcatctgtt atgctaaagt ttttcttgta 2220 
catgaaaccc tggaagacct actacaaaaa aactgttgtt tggcccccat agcaggtgaa 2280 
ctcattttgt gcttttaata gaaagacaaa tccaccccag taatattgcc cttacgtagt 2340 
tgtttaccat tattcaaagc tcaaaataga atttgaagcc ctctcacaaa atctgtgatt 2400 
aatttgctta attagagctt ctatccctca agcctaccta ccataaaacc agccatatta 2460 
ctgatactgt tcagtgcatt tagccaggag acttacgttt tgagtaagtg agatccaagc 2520 
agacgtgtta aaatcagcac tcctggactg gaaattaaag attgaaaggg tagactactt 2580 
ttcttttttt tactcaaaag tttagagaat ctctgtttct ttccatttta aaaacatatt 2640 
ttaagataat agcataaaga ctttaaaaat gttcctcccc tccatcttcc cacacccagt 2700 
caccagcact gtattttctg tcaccaagac aatgatttct tgttattgag gctgttgctt 2760 
ttgtggatgt gtgattttaa ttttcaataa acttttgcat cttggtttaa aagaaa 2816 

<210> 334 

<211> 2082 

<212> DNA 

<213> Homo sapiens 

<400> 334 

agatgctaca gcgactgcac acccaggctg tatgatacag cctattgctc ccgggctgca 60 
aacctgtcca gcatgtgatg tggtgggata ctgaattgaa taccgaatac tgtaggcaat 120 
tgtaacacag tggtaagtct ttgtgtatct aaacatagct aaacaccaaa aggtatagta 180 
agaatatggt attataatct tatggaacta tcattgtata tgtggtttgt caaccagaat 240 
gtagttatac agcacaggac tgtgcttatg atgtgccaag cacagctctc agtactaact 300 
cctttaatct tcatatcaac cctaggaggt aacttcttaa gtagattcat attgtaaggg 360 
tctcggggtg ggggggttgg caaaatcctg gagccagaag aaaggacagc agcattgatc 420 
aatcttacag ctaacatgtt gtacctggaa aacaatgccc agactcaatt tagtgagcca 480 
cagtacacga acctggggct cctgaacagc atggaccagc agattcagaa cggctcctcg 540 
tccaccagtc cctataacac agaccacgcg cagaacagcg tcacggcgcc ctcgccctac 600 
gcacagccca gctccacctt cgatgctctc tctccatcac ccgccatccc ctccaacacc 660 
gactacccag gcccgcacag tttcgacgtg tccttccagc agtcgagcac cgccaagtcg 720 
gccacctgga cgtattccac tgaactgaag aaactctact gccaaattgc aaagacatgc 780 
cccatccaga tcaaggtgat gaccccacct cctcagggag ctgttatccg cgccatgcct 840 
gtctacaaaa aagctgagca cgtcacggag gtggtgaagc ggtgccccaa ccatgagctg 900 
agccgtgaat tcaacgaggg acagattgcc cctcctagtc atttgattcg agtagagggg 960 
aacagccatg cccagtatgt agaagatccc atcacaggaa gacagagtgt gctggtacct 1020 
tatgagccac cccaggttgg cactgaattc acgacagtct tgtacaattt catgtgtaac 1080 
agcagttgtg ttggagggat gaaccgccgt ccaattttaa tcattgttac tctggaaacc 1140 
agagatgggc aagtcctggg ccgacgctgc tttgaggccc ggatctgtgc ttgcccagga 1200 
agagacagga aggcggatga agatagcatc agaaagcagc aagtttcgga cagtacaaag 1260 
aacggtgatg gtacgaagcg cccgtctcgt cagaacacac atggtatcca gatgacatcc 1320 
atcaagaaac gaagatcccc agatgatgaa ctgttatact taccagtgag gggccgtgag 1380 
acttatgaaa tgctgttgaa gatcaaagag tccctggaac tcatgcagta ccttcctcag 1440 
cacacaattg aaacgtacag gcaacagcaa cagcagcagc accagcactt acttcagaaa 1500 
cagtgagtgt atcaacgtgt cattttagga ggcatgagtg acggtgactt tatttggatc 1560 
agcaataggg tgattgatga gcaatgtgga acataatggg agatagcaga ttgtcataga 1620 
ttcagatgac ctggtatggc aaccctcttt cagttgcaac cttttttacg tgtcttatta 1680 
taaccttccc ttcagaattc cacttatgtt ctgaaattaa atacaaacca tttctggtga 1740 
attacaaaga aactcacact aacagttctc ttctctatat gcctggtcca tacacactaa 1800 
cagtaagtac acactctatt tggtagtgat gtgtatattt gaaaacatga aatcttttct 1860 
catcccaatg gattgtctta taaatctcct gggatgcaca ctatccactt ttgggaataa 1920 
cactgtagac cagggatagc aaataggctt tactataata taaagtgact tgtttgaatg 1980 
ctgtaatgag aagaattctg agacctagtg catgataatt ggggaaatat ctgggtgcag 2040 
aaggataagg tagcatcatg ttgccgtatt ttagcatctc tg 2082 
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<210> 335 / 
<211> 4849 
, <212> DNA 
<213> Homo sapiens 

<400> 335 

cgttgatatc aaagacagtt gaaggaaatg aattttgaaa cttcacggtg tgccacccta 60 
cagtactgcc ctgaccctta catccagcgt ttcgtagaaa ccccagctca tttctcttgg 120 
aaagaaagtt attaccgatc caccatgtcc cagagcacac agacaaatga attcctcagt 180 
ccagaggttt tccagcatat ctgggatttt ctggaacagc ctatatgttc agttcagccc 240 
attgacttga actttgtgga tgaaccatca gaagatggtg cgacaaacaa gattgagatt 300 
agcatggact gtatccgcat gcaggactcg gacctgagtg accccatgtg gccacagtac 360 
acgaacctgg ggctcctgaa cagcatggac cagcagattc agaacggctc ctcgtccacc 420 
agtccctata acacagacca cgcgcagaac agcgtcacgg cgccctcgcc ctacgcacag 480 
cccagctcca ccttcgatgc tctctctcca tcacccgcca tcccctccaa caccgactac 540 
ccaggcccgc acagtttcga cgtgtccttc cagcagtcga gcaccgccaa gtcggccacc 600 
tggacgtatt ccactgaact gaagaaactc tactgccaaa ttgcaaagac atgccccatc 660 
cagatcaagg tgatgacccc acctcctcag ggagctgtta tccgcgccat gcctgtctac 720 
aaaaaagctg agcacgtcac ggaggtggtg aagcggtgcc ccaaccatga gctgagccgt 780 
gaattcaacg agggacagat tgcccctcct agtcatttga ttcgagtaga ggggaacagc 840 
catgcccagt atgtagaaga tcccatcaca ggaagacaga gtgtgctggt accttatgag 900 
ccaccccagg ttggcactga attcacgaca gtcttgtaca atttcatgtg taacagcagt 960 
tgtgttggag ggatgaaccg ccgtccaatt ttaatcattg ttactctgga aaccagagat 1020 
gggcaagtcc tgggccgacg ctgctttgag gcccggatct gtgcttgccc aggaagagac 1080 
aggaaggcgg atgaagatag catcagaaag cagcaagttt cggacagtac aaagaacggt 1140 
gatggtacga agcgcccgtt tcgtcagaac acacatggta tccagatgac atccatcaag 1200 
aaacgaagat ccccagatga tgaactgtta tacttaccag tgaggggccg tgagacttat 1260 
gaaatgctgt tgaagatcaa agagtccctg gaactcatgc agtaccttcc tcagcacaca 1320 
attgaaacgt acaggcaaca gcaacagcag cagcaccagc acttacttca gaaacagacc 1380 
tcaatacagt ctccatcttc atatggtaac agctccccac ctctgaacaa aatgaacagc 1440 
atgaacaagc tgccttctgt gagccagctt atcaaccctc agcagcgcaa cgccctcact 1500 
cctacaacca ttcctgatgg catgggagcc aacattccca tgatgggcac ccacatgcca 1560 
atggctggag acatgaatgg actcagcccc acccaggcac tccctccccc actctccatg 1620 
ccatccacct cccagtgcac acccccacct ccgtatccca cagattgcag cattgtcagt 1680 
ttcttagcga ggttgggctg ttcatcatgt ctggactatt tcacgaccca ggggctgacc 1740 
accatctatc agattgagca ttactccatg gatgatctg$ caagtctgaa aatccctgag 1800 
caatttcgac atgcgatctg gaagggcatc ctggaccacc ggcagctcca cgaattctcc 1860 
tccccttctc atctcctgcg gaccccaagc agtgcctcta cagtcagtgt gggctccagt 1920 
gagacccggg gtgagcgtgt tattgatgct gtgcgattca ccctccgcca gaccatctct 1980 
ttcccacccc gagatgagtg gaatgacttc aactttgaca tggatgctcg ccgcaataag 2040 
caacagcgca tcaaagagga gggggagtga gcctcaccat gtgagctctt cctatccctc 2100 
tcctaactgc cagcycccta aaagcactcc tgcttaatct tcaaagcctt ctccctagct 2160 
cctccccttc ctcttgtctg atttcttagg ggaaggagaa gtaagaggct acctcttacc 2220 
taacatctga cctggcatct aattctgatt ctggctttaa gccttcaaaa ctatagcttg 2280 
cagaactgta gctgccatgg ctaggtagaa gtgagcaaaa aagagttggg tgtctcctta 2340 
agctgcagag atttctcatt gacttttata aagcatgttc acccttatag tctaagacta 2400 
tatatataaa tgtataaata tacagtatag atttttgggt ggggggcatt gagtattgtt 2460 
taaaatgtaa tttaaatgaa agaaaattga gttgcactta ttgaccattt tttaatttac 2520 
ttgttttgga tggcttgtct atactccttc ccttaagggg tatcatgtat ggtgataggt 2580 
atctagagct taatgctaca tgtgagtgac gatgatgtac agattctttc agttctttgg 2640 
attctaaata catgccacat caaacctttg agtagatcca tttccattgc ttattatgta 2700 
ggtaagactg tagatatgta ttcttttctc agtgttggta tattttatat tactgacatt 2760 
tcttctagtg atgatggttc acgttggggt gatttaatcc agttataaga agaagttcat 2820 
gtccaaacgt cctctttagt ttttggttgg gaatgaggaa aattcttaaa aggcccatag 2880 
cagccagttc aaaaacaccc gacgtcatgt atttgagcat atcagtaacc cccttaaatt 2940 
taataccaga taccttatct tacaatattg attgggaaaa catttgctgc cattacagag 3000 
gtattaaaac taaatttcac tactagattg actaactcaa atacacattt gctactgttg 3060 
taagaattct gattgatttg attgggatga atgccatcta tctagttcta acagtgaagt 3120 
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tttactgtct attaatattc agggtaaata ggaatcattc agaaatgttg agtctgtact 3180 
aaacagtaag atatctcaat gaaccataaa ttcaactttg taaaaatctt ttgaagcata 3240 
xjataatattg tttggtaaat gtttcttttg tttggtaaat gtttctttta aagaccctcc 3300 
tattctataa aactctgcat gtagaggctt gtttaccttt ctctctctaa ggtttacaat 3360 
aggagtggtg atttgaaaaa tataaaatta tgagattggt tttcctgtgg cataaattgc 3420 
atcactgtat cattttcttt tttaaccggt aagagtttca gtttgttgga aagtaactgt 3480 
gagaacccag tttcccgtcc atctccctta gggactaccc atagacatga aaggtcccca 3540 
cagagcaaga gataagtctt tcatggctgc tgttgcttaa accacttaaa cgaagagttc 3600 
ccttgaaact ttgggaaaac atgttaatga caatattcca gatctttcag aaatataaca 3660 
catttttttg catgcatgca aatgagctct gaaatcttcc catgcattct ggtcaagggc 3720 
tgtcattgca cataagcttc cattttaatt ttaaagtgca aaagggccag cgtggctcta 3780 
aaaggtaatg tgtggattgc ctctgaaaag tgtgtatata ttttgtgtga aattgcatac 3840 
tttgtatttt gattattttt tttttcttct tgggatagtg ggatttccag aaccacactt 3900 
gaaacctttt tttatcgttt ttgtattttc atgaaaatac catttagtaa gaataccaca 3960 
tcaaataaga aataatgcta caattttaag aggggaggga agggaaagtt tttttttatt 4020 
atttttttaa aattttgtat gttaaagaga atgagtcctt gatttcaaag ttttgttgta 4080 
cttaaatggt aataagcact gtaaacttct gcaacaagca tgcagctttg caaacccatt 4140 
aaggggaaga atgaaagctg ttccttggtc ctagtaagaa gacaaactgc ttcccttact 4200 
ttgctgaggg tttgaataaa cctaggactt ccgagctatg tcagtactat tcaggtaaca 4260 
ctagggcctt ggaaattcct gtactgtgtc tcatggattt ggcactagcc aaagcgaggc 4320 
acccttactg gcttacctcc tcatggcagc ctactctcct tgagtgtatg agtagccagg 4380 
gtaaggggta aaaggatagt aagcatagaa accactagaa agtgggctta atggagttct 4440 
tgtggcctca gctcaatgca gttagctgaa gaattgaaaa gtttttgttt ggagacgttt 4500 
ataaacagaa atggaaagca gagttttcat taaatccttt tacctttttt ttttcttggt 4560 
aatcccctaa aataacagta tgtgggatat tgaatgttaa agggatattt tttttctatt 4620 
atttttataa ttgtacaaaa ttaagcaaat gttaaaagtt ttatatgctt tattaatgtt 4680 
ttcaaaaggt attatacatg tgatacattt tttaagcttc agttgcttgt cttctggtac 4740 
tttctgttat gggcttttgg ggagccagaa gccaatctac aatctctttt tgtttgccag 4800 
gacatgcaat aaaatttaaa aaataaataa aaactaatta agaaataaa 4849 

<210> 336 

<211> 1386 

<212> DNA 

<213> Homo sapiens 

<400> 336 ' 

atgttgtacc tggaaaacaa tgcccagact caatttagtg agccacagta cacgaacctg 60 
gggctcctga acagcatgga ccagcagatt cagaacggct cctcgtccac cagtccctat 120 
aacacagacc acgcgcagaa cagcgtcacg gcgccctcgc cctacgcaca gcccagctcc 180 
accttcgatg ctctctctcc atcacccgcc atcccctcca acaccgacta cccaggcccg 240 
cacagtttcg acgtgtcctt ccagcagtcg agcaccgcca agtcggccac ctggacgtat 300 
tccactgaac tgaagaaact ctactgccaa attgcaaaga catgccccat ccagatcaag 360 
gtgatgaccc cacctcctca gggagctgtt atccgcgcca tgcctgtcta caaaaaagct 420 
gagcacgtca cggaggtggt gaagcggtgc cccaaccatg agctgagccg tgaattcaac 480 
gagggacaga ttgcccctcc tagtcatttg attcgagtag aggggaacag ccatgcccag 540 
tatgtagaag atcccatcac aggaagacag agtgtgctgg taccttatga gccaccccag 600 
gttggcactg aattcacgac agtcttgtac aatttcatgt gtaacagcag ttgtgttgga 660 
gggatgaacc gccgtccaat tttaatcatt gttactctgg aaaccagaga tgggcaagtc 720 
ctgggccgac gctgctttga ggcccggatc tgtgcttgcc caggaagaga caggaaggcg 780 
gatgaagata gcatcagaaa gcagcaagtt tcggacagta caaagaacgg tgatggtacg 840 
aagcgcccgt ttcgtcagaa cacacatggt atccagatga catccatcaa gaaacgaaga 900 
tccccagatg atgaactgtt atacttacca gtgaggggcc gtgagactta tgaaatgctg 960 
ttgaagatca aagagtccct ggaactcatg cagtaccttc ctcagcacac aattgaaacg 1020 
tacaggcaac agcaacagca gcagcaccag cacttacttc agaaacagac ctcaatacag 1080 
tctccatctt catatggtaa cagctcccca cctctgaaca aaatgaacag catgaacaag 1140 
ctgccttctg tgagccagct tatcaaccct cagcagcgca acgccctcac tcctacaacc 1200 
attcctgatg gcatgggagc caacattccc atgatgggca cccacatgcc aatggctgga 1260 
gacatgaatg gactcagccc cacccaggca ctccctcccc cactctccat gccatccacc 1320 
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tcccactgca cacccccacc tccgtatccc acagattgca gcattgtcag gatctggcaa 1380 
gtctga ^ 1386 

<210> 337 
<211> 1551 
<212> DNA 

<213> Homo sapiens 
<400> 337 

atgtcccaga gcacacagac aaatgaattc ctcagtccag aggttttcca gcatatctgg 60 
gattttctgg aacagcctat atgttcagtt cagcccattg acttgaactt tgtggatgaa 120 
ccatcagaag atggtgcgac aaacaagatt gagattagca tggactgtat ccgcatgcag 180 
gactcggacc tgagtgaccc catgtggcca cagtacacga acctggggct cctgaacagc 240 
atggaccagc agattcagaa cggctcctcg tccaccagtc cctataacac agaccacgcg 300 
cagaacagcg tcacggcgcc ctcgccctac gcacagccca gctccacctt cgatgctctc 360 
tctccatcac ccgccatccc ctccaacacc gactacccag gcccgcacag tttcgacgtg 420 
tccttccagc agtcgagcac cgccaagtcg gccacctgga cgtattccac tgaactgaag 480 
aaactctact gccaaattgc aaagacatgc cccatccaga tcaaggtgat gaccccacct 540 
cctcagggag ctgttatccg cgccatgcct gtctacaaaa aagctgagca cgtcacggag 600 
gtggtgaagc ggtgccccaa ccatgagctg agccgtgaat tcaacgaggg, acagattgcc 660 
cctcctagtc atttgattcg agtagagggg aacagccatg cccagtatgt agaagatcec 720 
atcacaggaa gacagagtgt gctggtacct tatgagccac cccaggttgg cactgaattc 780 
acgacagtct tgtacaattt catgtgtaac agcagttgtg ttggagggat gaaccgccgt 840 
ccaattttaa tcattgttac tctggaaacc agagatgggc aagtcctggg ccgacgctgc 900 
tttgaggccc ggatctgtgc ttgcccagga agagacagga aggcggatga agatagcatc 960 
agaaagcagc aagtttcgga cagtacaaag aacggtgatg gtacgaagcg cccgtttcgt 1020 
cagaacacac atggtatcca gatgacatcc atcaagaaac gaagatcccc agatgatgaa 1080 
ctgttatact taccagtgag gggccgtgag acttatgaaa tgctgttgaa gatcaaagag 1140 
tccctggaac tcatgcagta ccttcctcag cacacaattg aaacgtacag gcaacagcaa 1200 
cagcagcagc accagcactt acttcagaaa cagacctcaa tacagtctcc atcttcatat 1260 
ggtaacagct ccccacctct gaacaaaatg aacagcatga acaagctgcc ttctgtgagc 1320 
cagcttatca accctcagca gcgcaacgcc ctcactccta caaccattcc tgatggcatg 1380 
.ggagccaaca ttcccatgat gggcacccac atgccaatgg ctggagacat gaatggactc 1440 
agccccaccc aggcactccc tcccccactc tccatgccat ccacctccca ctgcacaccc 1500 
ccacctccgt atcccacaga ttgcagcatt gtcaggatcr, ggcaagtctg a 1551 

<210> 338 . 
<211> 586 
<212> PRT 

<213> Homo sapiens 
<400> 338 

Met Leu Tyr Leu Glu Asn Asn Ala Gin Thr Gin Phe Ser Glu Pro Gin 

1 5 io is 

Tyr Thr Asn Leu Gly Leu Leu Asn Ser Met Asp Gin Gin He Arg Asn 

20 25 30 

Gly Ser Ser Ser Thr Ser Pro Tyr Asn Thr Asp His Ala Gin Asn Ser 

35 40 45 

Val Thr Ala Pro Ser Pro Tyr Ala Gin Pro Ser Pro Thr Phe Asp Ala 

50 55 60 

Leu Ser Pro Ser Pro Ala He Pro Ser Asn Thr Asp Tyr Pro ply Pro 
65 70 75 " 80 

His Ser Ser Asp Val Ser Phe Gin Gin Ser Ser Thr Ala Lys Ser Ala 

85 90 95 

Thr Trp Thr Tyr Ser Thr Glu Leu Lys Lys Leu Tyr Cys Gin He Ala 

100 , 105 no 

Lys Thr Cys Pro He Gin He Lys Val Met Thr Pro Pro Pro Gin Gly 
115 120 125 
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Ala Val 
130 
iGlu Val 
145 

Glu Gly 

Ser His 

Leu Val 

Leu Tyr 
210 
Arg Pro 
225 

Leu Gly 

Asp Arg 

Ser Thr 

His Gly 
290 
.Glu Leu 
305 

Leu Lys 

Thr He 

Leu Gin 

Ser Pro 
370 
Ser Gin 
385 

He Pro 

Pro Met 

Pro Pro 

Tyr Pro 
450 
Ser Ser 
465 

Gin He 

Glu Gin 

Leu His 

Ala Ser 
530 
He Asp 
545 

Arg Asp 



He Arg Ala Met 

Val Lys Arg Cys 
150 

Gin He Ala Pro 
165 

Ala Gin Tyr Val 
180 

Pro Tyr Glu Pro 
195 

Asn Phe Met Cys 

He Leu He lie 
230 

Arg Arg Cys Phe 
245 

Lys Ala Asp Glu 
260 

Lys Asn Gly Asp 
275 

He Gin Met Thr 

Leu Tyr Leu Pro 
310 

He Lys Glu Ser 
325 

Glu Thr Tyr Arg 
340 

Lys Gin Thr Ser 
355 

Pro Leu Asn Lys 

Leu He Asn Pro 
390 

Asp Gly Met Gly 
405 

Ala Gly Asp Met 
420 

Leu Ser Met Pro 
435 

Thr Asp Cys Ser 

Cys Leu Asp Tyr 
470 

Glu His Tyr Ser 
485 

Phe Arg His Ala 
500 

Glu Phe Ser Ser 
515 

Thr Val Ser Val 



Ala Val Arg Phe 
550 

Glu Trp Asn Asp 
565 

Lys Gin Gin Arg He Lys 
580 



Pro Val Tyr Lys Lys 
135 

Pro Asn His Glu Leu 
155 

Pro Ser His Leu He 
170 

Glu Asp Pro He Thr 
185 

Pro Gin Val Gly Thr 
200 

Asn Ser Ser Cys Val 
215 

Val Thr Leu Glu Thr 
235 

Glu Ala Arg He Cys 
250 

Asp Ser lie Arg Lys 
265 

Gly Thr Lys Arg Pro 
280 

Ser He Lys Lys Arg 
295 

Val Arg Gly Arg Glu 
315 

Leu Glu Leu Met Gin 
330 

Gin Gin Gin Gin Gin 
345 

He Gin Ser Pro Ser 
360 

Met Asn Ser Met Asn 
375 

Gin Gin Arg Asn Ala 
395 

Ala Asn lie Pro Me^ 
410 

Asn Gly Leu Ser Pro 
425 

Ser Thr Ser His Cys 
440 

He Val Ser Phe Leu 
455 

Phe Thr Thr Gin Gly 
475 

Met Asp Asp Leu Ala 
490 

He Trp Lys Gly He 
505 

Pro Ser His Leu Leu 
520 

Gly Ser Ser Glu Thr 
535 

Thr Leu Arg Gin Thr 
555 

Phe Asn Phe Asp Met 
570 

Glu Glu Gly Glu 
585 



Ala Glu His Val Thr 
140 

Ser Arg Glu Phe Asn 
160 

Arg Val Glu Gly Asn 
175 

Gly Arg Gin Ser Val 
190 

Glu Phe Thr Thr Val 
205 

Gly Gly Met Asn Arg 
220 

Arg Asp Gly Gin Val 
240 

Ala Cys Pro Gly Arg 
255 

Gin Gin Val Ser Asp 
270 

Phe Arg Gin Asn Thr 
285 

Arg Ser Pro Asp Asp 
300 

Thr Tyr Glu Met Leu 
320 

Tyr Leu Pro Gin His 
335 

Gin His Gin His Leu 
350 

Ser Tyr Gly Asn Ser 
365 

Lys Leu Pro Ser Val 
380 

Leu Thr Pro Thr Thr 
400 

Met Gly Thr His Met 
415 

Thr Gin Ala Leu Pro 
430 

Thr Pro Pro Pro Pro 
445 

Ala Arg Leu Gly Cys 
460 

Leu Thr Thr He Tyr 
480 

Ser Leu Lys He Pro 
495 

Leu Asp His Arg Gin 
510 

Arg Thr Pro Ser Ser 
525 

Arg Gly Glu ? Arg Val 
540 

He Ser Phe Pro Pro 
560 

Asp Ala Arg Arg Asn 
575 
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<210> 339 
"<211> 641 
<212> PRT 

<213> Homo sapiens 
<400> 339 

Met Ser Gin Ser Thr Gin Thr Asn Glu Phe Leu Ser Pro Glu Val Phe 

1 5 io 15 

Gin His He Trp Asp Phe Leu Glu Gin Pro He Cys Ser Val Gin Pro 

20 25 30 

He Asp Leu Asn Phe Val Asp Glu Pro Ser Glu Asp Gly Ala Thr Asn 

35 40 45 

Lys He Glu He Ser Met Asp Cys He Arg Met Gin Asp Ser Asp Leu 

50 55 60 

Ser Asp Pro Met Trp Pro Gin Tyr Thr Asn Leu Gly Leu Leu Asn Ser 
65 70 75 " 80 

Met Asp Gin Gin He Gin Asn Gly Ser Ser Ser Thr Ser Pro Tyr Asn 

85 90 95 

Thr Asp His Ala Gin Asn Ser Val Thr Ala Pro Ser Pro Tyr Ala Gin 

100 105 no 

Pro Ser Ser Thr Phe Asp Ala Leu Ser Pro Ser Pro Ala He Pro Ser 

115 120 125 

Asn Thr Asp Tyr Pro Gly Pro His Ser Phe Asp Val Ser Phe Gin Gin 

130 135 140 

Ser Ser Thr Ala Lys Ser Ala Thr Trp Thr Tyr Ser Thr Glu Leu Lys 
145 150 155 160 

Lys Leu Tyr Cys Gin He Ala Lys Thr Cys Pro He Gin He Lys Val 

165 170 175 

Met Thr Pro Pro Pro Gin Gly Ala Val He Arg Ala Met Pro Val Tyr 

180 185 190 

Lys Lys Ala Glu His Val Thr Glu Val Val Lys Arg Cys Pro Asn His 

195 200 205 

Glu Leu Ser Arg Glu Phe Asn Glu Gly Gin He Ala Pro Pro Ser His 

210 215 < 220 

Leu He Arg Val Glu Gly Asn Ser His Ala Gin Tyr Val Glu Asp Pro 
225 230 235 240 

He Thr Gly Arg Gin Ser Val Leu Val Pro Tyr Glu Pro Pro Gin Val 

245 250 255 

Gly Thr Glu Phe Thr Thr Val Leu Tyr Asn Phe Met Cys Asn Ser Ser 

260 265 270 

Cys Val Gly Gly Met ^ Asn Arg Arg Pro He Leu He He Val Thr Leu 

275 280 285 

Glu Thr Arg Asp Gly Gin Val Leu Gly Arg Arg Cys Phe Glu Ala Ara 

29 0 295 300 

He Cys Ala Cys Pro Gly Arg Asp Arg Lys Ala Asp Glu Asp Ser He 
305 310 315 320 

Arg Lys Gin Gin Val Ser Asp Ser Thr Lys Asn Gly Asp Gly Thr Lys 

325 330 335 

Arg Pro Phe Arg Gin Asn Thr His Gly He Gin Met Thr Ser He Lys 

340 345 350 i 

Lys Arg Arg Ser Pro Asp Asp Glu Leu Leu Tyr Leu Pro Val Arg Gly 

355 3 6 o 365 

Arg Glu Thr Tyr Glu Met Leu Leu Lys He Lys Glu Ser Leu Glu Leu 

37 0 375 380 

Met Gin Tyr Leu Pro Gin His Thr He Glu Thr Tyr Arg Gin Gin Gin 
385 390 395 400 
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Gin Gin 

^Pro Ser 

Met Asn 

Asn Ala 
450 
Pro Met 
465 

Ser Pro 

His Cys 

Phe Leu 

Gin Gly 
530 
Leu Ala 
545 

Gly He 

Leu Leu 

Glu Thr 

Gin Thr 
610 
Asp Met 
625 
Glu 



Gin His Gin 
405 

Ser Tyr Gly 

420 
Lys Leu Pro 
435 

Leu Thr Pro 

Met Gly Thr 

Thr Gin Ala 
485 

Thr Pro Pro 

500 
Ala Arg Leu 
515 

Leu Thr Thr 

Ser Leu Lys 

Leu Asp His 
565 

Arg Thr Pro 

580 
Arg Gly Glu 
595 

lie Ser Phe 
Asp Ala Arg 



His Leu Leu Gin 
Asn Ser 
Ser Val 



Thr Thr 
455 
His Met 
470 

Leu Pro 

Pro Pro 

Gly Cys 

He Tyr 
535 
He Pro 
550 

Arg Gin 

Ser Ser 

Arg Val 

Pro Pro 
615 
Arg Asn 
630 



Ser Pro 
425 
Ser Gin 
440 

He Pro 



Pro Met 

Pro Pro 

Tyr Pro 
505 
Ser Ser 
520 

Gin He 

Glu Gin 

Leu His 

Ala Ser 
585 
He Asp 
600 

Arg Asp 
Lys Gin 



Lys Gin 
410 

Pro Leu 

Leu He 

Asp Gly 

Ala Gly 
475 
Leu Ser 
490 

Thr Asp 

Cys Leu 

Glu His 

Phe Arg 
555 
Glu Phe 
570 

Thr Val 

Ala Val 

Glu Trp 

Gin Arg 
635 



Thr Ser 

Asn Lys 

Asn Pro 
445 
Met Gly 
460 

Asp Met 

Met Pro 

Cys Ser 

Asp Tyr 
525 
Tyr Ser 
540 

His Ala 

Ser Ser 

Ser Val 

Arg Phe 
605 
Asn Asp 
620 

He Lys 



He Gin Ser 

415 
Met Asn Ser 
430 

Gin Gin Arg 

Ala Asn He 

Asn Gly Leu 
480 

Ser Thr Ser 

495 
He Val Gly 
510 

Phe Thr Thr 

Met Asp Asp 

He Trp Lys 
560 

Pro Ser His 

575 
Gly Ser Ser 
590 

Thr Leu Arg 

Phe Asn Phe 

Glu Glu Gly 
640 



<210> 340 
<211> 448 
<212> PRT 
<213> Homo sapiens 



<400> 340 



Met 


Ser 


Gin 


Ser 


Thr Gin Thr Asn 


1 








5 


Gin 


His 


He 


Trp 


Asp Phe Leu Glu 








20 




He 


Asp 


Leu 


Asn 


Phe Val Asp Glu 






35 




40 


Lys 


He 


Glu 


He 


Ser Met Asp Cys 




50 






. 55 


Ser 


Asp 


Pro 


Met 


Trp Pro Gin Tyr 


65 








70 


Met 


Asp 


Gin 


Gin 


He Gin Asn Gly 










85 


Thr 


Asp 


His 


Ala 


Gin Asn Ser Val 








100 




Pro 


Ser 


Ser 


Thr 


Phe Asp Ala Leu 






115 




120 


Asn 


Thr 


Asp 


Tyr 


Pro Gly Pro His 




130 






135 



Glu Phe Leu Ser Pro Glu Val Phe 

10 15 
Gin Pro He Cys Ser Val Gin Pro 
25 30 
Pro Ser Glu Asp Gly Ala Thr Asn 
45 

He Arg Met Gin Asp Ser Asp Leu 
60 

Thr Asn Leu Gly Leu Leu Asn Ser 

75 80 
Ser Ser Ser Thr Ser Pro Tyr Asn 

90 ! 95 
Thr Ala Pro Ser Pro Tyr Ala Gin 
105 110 
Ser Pro S.er Pro Ala He Pro Ser 
125 

Ser Phe Asp Val Ser Phe Gin Gin 
140 
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Ser Ser Thr Ala Lys Ser Ala Thr Trp Thr Tyr Ser Thr Glu Leu Lys 
145 150 155 160 

^Lys Leu Tyr Cys Gin lie Ala Lys Thr Cys Pro He Gin He Lys Val 

165 170 175 

Met Thr Pro Pro Pro Gin Gly Ala Val He Arg Ala Met Pro Val Tyr 

180 185 190 

Lys Lys Ala Glu His Val Thr Glu Val Val Lys Arg Cys Pro Asn His 

195 200 205 

Glu Leu Ser Arg Glu Phe Asn Glu Gly Gin He Ala Pro Pro Ser His 

210 215 220 

Leu He Arg Val Glu Gly Asn Ser His Ala Gin Tyr Val Glu Asp Pro 
225 230 235 240 

He Thr Gly Arg Gin Ser Val Leu Val Pro Tyr Glu Pro Pro Gin Val 

245 250 255 

Gly Thr Glu Phe Thr Thr Val Leu Tyr Asn Phe Met Cys Asn Ser Ser 

260 265 270 

Cys Val Gly Gly Met Asn Arg Arg Pro He Leu He He Val Thr Leu 

275 280 285 

Glu Thr Arg Asp Gly Gin Val Leu Gly Arg Arg Cys Phe Glu Ala Arg 

290 295 300 

He Cys Ala Cys Pro Gly Arg Asp Arg Lys Ala Asp Glu Asp Ser He 
305 310 , 315 k 320 

Arg Lys Gin Gin Val Ser Asp Ser Thr Lys Asn Gly Asp Gly Thr Lys 

325 330 " 335 

Arg Pro Phe Arg Gin Asn Thr His Gly He Gin Met Thr Ser He Lys 

340 345 350 

Lys Arg Arg Ser Pro Asp Asp Glu Leu Leu Tyr Leu Pro Val Arg Gly 

355 360 365 

Arg Glu Thr Tyr Glu Met Leu Leu Lys He Lys Glu Ser Leu Glu Leu 

370 375 380 

Met Gin Tyr Leu Pro Gin His Thr He Glu Thr Tyr Arg Gin Gin Gin 
385 390 395 400 

Gin Gin Gin His Gin His Leu Leu Gin Lys His Leu Leu Ser Ala Cys 

405 410 415 

Phe Arg Asn Glu Leu Val Glu Pro Arg Arg Glu Thr Pro Lys Gin Ser 

420 425 ' 430 

Asp Val Phe Phe Arg His Ser Lys Pro Pro Asn Arg Ser Val Tyr Pro 
435 440 445 



<210> 341 
<211> 356 
<212> PRT 

<213> Homo sapiens 
<400> 341 

Met Leu Tyr Leu Glu Asn Asn Ala Gin Thr Gin Phe Ser Glu Pro Gin 

15 10 15 

Tyr Thr Asn Leu Gly Leu Leu Asn Ser Met Asp Gin Gin He Gin Asn 

20 25 30 

Gly Ser Ser Ser Thr Ser Pro Tyr Asn Thr Asp His Ala Gin Asn Ser 

35 40 45 

Val Thr Ala Pro Ser Pro Tyr Ala Gin Pro Ser Ser Thr Phe Asp Ala 

50 55 60 

Leu Ser Pro Ser Pro Ala He Pro Ser Asn Thr Asp Tyr Pro Gly Pro 
65 70 75 80 

His Ser Phe Asp Val Ser Phe Gin Gin Ser Ser Thr Ala Lys Ser Ala 
85 90 95 
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Thr 


Trp 


Thr 


Tyr 


Ser Thr Glu Leu Lys 








100 


105 


^Lys 


Thr 


Cys 


Pro 


He Gin He Lys Val 






115 




120 


Ala 


Val 


lie 


Arg 


Ala Met Pro Val Tyr 




130 






135 


Glu 


Val 


Val 




Arg Cys Pro Asn His 


145 








150 


Glu 


Gly 


Gin 


He 


Ala Pro Pro Ser His 










165 


Ser 


Hxs 


Ala 


Gin 


Tyr Val Glu Asp Pro 








180 


185 


Leu 


Val 


Pro 


Tyr 


Glu Pro Pro Gin Val 






195 




200 


Leu 


Tyr 


Asn 


Phe 


Met Cys Asn Ser Ser 




210 






215 


Arg 


Pro 


lie 


Leu 


He He Val Thr Leu 


225 








230 


Leu 


Gly 


Arg 


Arg 


Cys Phe Glu Ala Arg 










245 


Asp 


Arg 


Lys 


Ala 


Asp Glu Asp Ser He 








260 


265 


Ser 


Thr 


Lys 


Asn 


Gly Asp Gly Thr Lys 






275 




280 


His 


Gly 


He 


Gin 


Met Thr Ser He Lys 




290 






295 


Glu 


Leu 


Leu 


Tyr 


Leu Pro Val Arg Gly 


305 








310 


Leu 


Lys 


lie 


Lys 


Glu Ser Leu Glu Leu 










325 


Thr 


lie 


Glu 


Thr 


Tyr Arg Gin Gin Gin 








340 


345 


Leu 


Gin 


Lys 


Gin 








355 







<210> 342 
<211> 680 
<212> PRT 

<213> Homo sapiens 



<400> 342 



Met 


Asn 


Phe 


Glu 


Thr Ser Arg Cys Ala 


1 








5 


Pro 


Tyr 


He 


Gin 


Arg Phe Val Glu Thr 








20 


25 


Glu 


Ser 


Tyr 


Tyr 


Arg Ser Thr Met Ser 






35 




40 


Phe 


Leu 


Ser 


Pro 


Glu Val Phe Gin His 




50 






55 


Pro 


He 


Cys 


Ser 


Val Gin Pro He Asp 


65 








70 


Ser 


Glu 


Asp 


Gly 


Ala Thr' Asn Lys He 










85 


Arg 


Met 


Gin 


Asp 


Ser Asp Leu Ser Asp 








100 


105 


Asn 


Leu 


Gly 


Leu 


Leu Asn Ser Met Asp 






115 




120 



PCT/US01/21065 

148 



Lys 


Leu 


Tyr Cys 


Gin He Ala 








110 


Met 


Thr 


Pro Pro 


Pro Gin Gly 






125 




j->_y o 




Ala Glu 


His Val Thr 






140 




Gl ii 

V3J.U 




Ser Arg Glu Phe Asn 




155 




160 






Arg Val Glu Gly Asn 


170 






175 


Tl #» 


Thr 


Gly Arg Gin Ser Val 








190 


(11 v 


Thr 

X11X 


Glu Phe 


Thr Thr Val 






205 




v-ys 


vax 


Gly Gly Met Asn Arg 






220 




ulU 


mr 


Arg Asp Gly Gin Val 








240 


i j.e 


Cys 


Ala Cys 


Pro Gly Arg 








255 


Arg 


Lys 


Gin Gin Val Ser Asp 








270 


Arg 


Pro 


Ser Arg Gin Asn Thr 






285 




Lys 


Arg 


Arg Ser Pro Asp Asp 






300 




Arg 


Glu 


Thr Tyr Glu Met Leu 




315 




320 


Met 


Gin 


Tyr Leu 


Pro Gin His 


330 






335 


Gin 


Gin 


Gin His 


Gin His Leu 




i 




350 


1 ILL 


Leu 


Gin Tyr 


Cys Pro Asp 


1 0 
-LI/ 






15 


IrXO 


nl d 


His Phe 


Ser Trp Lys 








30 


Gin 


Ser 


Thr Gin 


Thr Asn Glu 






45 




He 


Trp 


Asp Phe Leu Glu Gin 






60 




Leu 


Asn 


Phe Val Asp Glu Pro 




75 




80 


Glu 


lie 


Ser Met Asp Cys He 


90 






95 


Pro 


Met 


Trp Pro Gin Tyr Thr 








110 


Gin 


Gin 


He Gin Asn Gly Ser 

1 



125 
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Ser Ser Thr Ser 
130 

"Ala. Pro Ser Pro 
145 

Pro Ser Pro Ala 

Phe Asp Val Ser 
180 

Thr Tyr Ser Thr 
195 

Cys Pro lie Gin 
210 

He Arg Ala Met 
225 

Val Lys Arg Cys 

Gin lie Ala Pro 
260 

Ala Gin Tyr Val 
275 

Pro Tyr Glu Pro 
290 

Asn Phe Met Cys 
305 

He Leu He lie 

Arg Arg Cys Phe 
340 

Lys Ala Asp Glu 
355 

Lys Asn Gly Asp 
370 

lie Gin Met Thr 
385 

Leu Tyr Leu Pro 

He Lys Glu Ser 
420 

Glu Thr Tyr Arg 
435 

Lys Gin Thr Ser 
450 

Pro Leu Asn Lys 
465 

Leu He Asn Pro 



Pro Tyr Asn 
135 

Tyr Ala Gin 

150 
He Pro Ser 
165 

Phe Gin Gin 

Glu Leu Lys 

He Lys Val 
215 

Pro Val Tyr 

230 
Pro Asn His 
245 

Pro Ser His 



Thr Asp His Ala 
Pro Ser 
Asn Thr 



Ser Ser 
185 
Lys Leu 
200 

Met Thr 
Lys Lys 
Glu Leu 



Ser Thr 
155 
Asp Tyr 
170 

Thr Ala 



Leu He 
265 

Glu Asp Pro He Thr 
280 

Gly Thr 



Pro Gin Val 
295 

Asn Ser Ser 

310 
Val Thr Leu 
325 

Glu Ala Arg 



Cys Val 
Glu Thr 



He Cys 
345 

Asp Ser He Arg Lys 
360 

Arg Pro 



Gly Thr Lys 
375 

Ser He Lys 

390 
Val Arg Gly 
405 

Leu Glu Leu 



Lys Arg 
Arg Glu 



Met Gin 
425 

Gin Gin Gin Gin Gin 
440 

Pro Ser 



He Gin Ser 
455 

Met Asn Ser 

470 
Gin Gin Arg 
485 

Ala Ash He 



Asp Gly Met Gly 
500 

Ala Gly Asp Met Asn Gly Leu 
515 

Leu Ser Met Pro 
530 

Thr Asp Cys Ser 
545 

Cys Leu Asp Tyr 



Glu His Tyr Ser 
580 



Ser Thr Ser 
535 

He Val Ser 

550 
Phe Thr Thr 
565 

Met Asp Asp 



Met Asn 

Asn Ala 

Pro Met 
505 
Ser Pro 
520 

Gin Cys 

Phe Leu 

Gin Gly 

Leu Ala 
585 



Tyr Cys 

Pro Pro 

Ala Glu 
235 
Ser Arg 
250 

Arg Val 

Gly Arg 

Glu Phe 

Gly Gly 
315 
Arg Asp 
330 

Ala Cys 

Gin Gin 

Phe Arg 

Arg Ser 
395 
Thr Tyr 
410 

Tyr Leu 

Gin His 

Ser Tyr 

Lys Leu 
475 
Leu Thr 
490 

Met Gly 
Thr Gin 
Thr Pro 



Gin Asn 
140 

Phe Asp 

Pro Gly 

Lys Ser 

Gin He 
205 
Pro Gin 
220 

His Val 

Glu Phe 

Glu Gly 

Gin Ser 
285 
Thr Thr 
300 

Met Asn 

Gly Gin 

Pro Gly 

Val Ser 
365 
Gin Asn 
380 

Pro Asp 

Glu Met 

Pro Gin 

Gin His 
445 
Gly Asn 
460 

Pro Ser 



Ser Val Thr 

Ala Leu Ser 
160 

Pro His Ser 
175 

Ala Thr Trp 
190 

Ala Lys Thr 

Gly Ala Val 

Thr Glu Val 
240 

Asn Glu Gly 

255 
Asn Ser His 
270 

Val Leu Val 

Val Leu Tyr 

Arg Arg Pro 
320 

Val Leu Gly 

335 
Arg Asp Arg 
350 

Asp Ser Thr 

Thr His Gly 

Asp Glu Leu 
. 400 
Leu Leu Lys 

415 
His Thr He 
430 

Leu Leu Gin 
Ser Ser Pro 



Ala Arg 
555 
Leu Thr 
570 

Ser Leu 



Pro Thr 

Thr His 

Ala Leu 
525 
Pro Pro 
540 

Leu Gly 



Thr He 
Lys He 



Val Ser Gin 
480 

Thr He Pro 

495 
Met Pro Met 
510 

Pro Pro Pro 

Pro ,Tyr Pro 

Cys Ser Ser 
560 

Tyr Gin lie 

575 
Pro Glu Gin 
590 
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Phe Arg His Ala He Trp Lys Gly He Leu Asp His Arg Gin Leu His 

595 600 605 

Glu Phe Ser Ser Pro Ser His Leu Leu Arg Thr Pro Ser Ser Ala Ser 

610 615 620 

Thr Val Ser Val Gly Ser Ser Glu Thr Arg Gly Glu Arg Val He Asp 
625 630 635 640 

Ala Val Arg Phe Thr Leu Arg Gin Thr He Ser Phe Pro Pro Arg Asp 

645 650 655 

Glu Trp Asn Asp Phe Asn Phe Asp Met Asp Ala Arg Arg Asn Lys Gin 

660 665 670 

Gin Arg He Lys Glu Glu Gly Glu 
675 680 



<210> 343 
<211> 461 
<212> PRT 
<213> Homo sapiens 

<400> 343 

Met Leu Tyr Leu Glu Asn Asn Ala Gin Thr Gin Phe Ser Glu Pro Gin 

15 10 15 

Tyr Thr Asn Leu Gly Leu Leu Asn Ser Met Asp Gin Gin He Gin Asn 

20 25 30 

Gly Ser Ser Ser Thr Ser Pro Tyr Asn Thr Asp His Ala Gin Asn Ser 

35 40 45 

Val Thr Ala Pro Ser Pro Tyr Ala Gin Pro Ser Ser Thr Phe Asp Ala 

50 55 60 

Leu Ser Pro Ser Pro Ala He Pro Ser Asn Thr Asp Tyr Pro Gly Pro 
65 70 75 * 80 

His Ser Phe Asp Val Ser Phe Gin Gin Ser Ser Thr Ala Lys Ser Ala 

85 90 95 

Thr Trp Thr Tyr Ser Thr Glu Leu Lys Lys Leu Tyr Cys Gin He Ala 

100 105 110 

Lys Thr Cys Pro He Gin He Lys Val Met Thr Pro Pro Pro Gin Gly 

115 120 125 

Ala Val He Arg Ala Met Pro Val Tyr Lys Lys Ala Glu His Val Thr 

130 135 140 

Glu Val Val Lys Arg Cys Pro Asn His Glu Leu Ser Arg Glu Phe Asn 
145 150 155 160 

Glu Gly Gin He Ala Pro Pro Ser His Leu He Arg Val Glu Gly Asn 

165 170 " 175 

Ser His Ala Gin Tyr Val Glu Asp Pro He Thr Gly Arg Gin Ser Val 

180 185 190 

Leu Val Pro Tyr Glu Pro Pro Gin Val Gly Thr Glu Phe Thr Thr Val 

195 200 205 

Leu Tyr Asn Phe Met Cys Asn Ser Ser Cys Val Gly Gly Met Asn Arg 

210 215 220 

Arg Pro He Leu He He Val Thr Leu Glu Thr Arg Asp Gly Gin Val 
225 230 235 240 

Leu Gly Arg Arg Cys Phe Glu Ala Arg He Cys Ala Cys Pro Gly Arg 

245 250 '255 

Asp Arg Lys Ala Asp Glu Asp Ser He Arg Lys Gin Gin Val Ser Asp 

260 265 270 

Ser Thr Lys Asn Gly Asp Gly Thr Lys Arg Pro Phe Arg Gin Asn Thr 

275 280 285 

His Gly He Gin Met Thr Ser He Lys Lys Arg Arg Ser Pro Asp Asp 
290 295 300 
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Glu Leu Leu (Tyr Leu Pro Val Arg Gly Arg Glu Thr Tyr Glu Met Leu 
305 310 315 ~ 320 

^Leu Lys lie Lys Glu Ser Leu Glu Leu Met Gin Tyr Leu Pro Gin His 

325 330 335 

Thr He Glu Thr Tyr Arg Gin Gin Gin Gin Gin Gin His Gin His Leu 

340 345 350 

Leu Gin Lys Gin Thr Ser He Gin Ser Pro Ser Ser Tyr Gly Asn Ser 

355 360 365 

Ser Pro Pro Leu Asn Lys Met Asn Ser Met Asn Lys Leu Pro Ser Val 

370 375 380 

Ser Gin Leu He Asn Pro Gin Gin Arg Asn Ala Leu Thr Pro Thr Thr 
385 390 395 400 

He Pro Asp Gly Met Gly Ala Asn He Pro Met Met Gly Thr His Met 

405 410 ^ 415 

Pro Met Ala Gly Asp Met Asn Gly Leu Ser Pro Thr Gin Ala Leu Pro 

420 425 430 

Pro Pro Leu Ser Met Pro Ser Thr Ser His Cys Thr Pro Pro Pro Pro 

435 440 445 

Tyr Pro Thr Asp Cys Ser He Val Arg He Trp Gin Val 
450 455 460 



<210> 344 
<211> 516 
<212> PRT 
<213> Homo sapiens 

<400> 344 

Met Ser Gin Ser Thr Gin Thr Asn Glu Phe Leu Ser Pro Glu Val Phe 

1 5 10 15 

Gin His He Trp Asp Phe Leu Glu Gin Pro He Cys Ser Val Gin Pro 

20 25 30 

He Asp Leu Asn Phe Val Asp Glu Pro Ser Glu Asp Gly Ala Thr Asn 

35 40 45 

Lys He Glu He Ser Met Asp Cys He Arg Met Gin Asp Ser Asp Leu 

50 55 1 60 

Ser Asp Pro Met Trp Pro Gin Tyr Thr Asn Leu Gly Leu Leu Asn Ser 
65 70 75 ~ 80 

Met Asp Gin Gin He Gin Asn Gly Ser Ser Ser Thr Ser Pro Tyr Asn 

85 90 95 

Thr Asp His Ala Gin Asn Ser Val Thr Ala Pro Ser Pro Tyr Ala Gin 

100 105 HO 

Pro Ser Ser Thr Phe Asp Ala Leu Ser Pro Ser Pro Ala He Pro Ser 

115 120 125 

Asn Thr Asp Tyr Pro Gly Pro His Ser Phe Asp Val Ser Phe Gin Gin 

130 135 J40 

Ser Ser Thr Ala Lys Ser Ala Thr Trp Thr Tyr Ser Thr Glu Leu Lys 
145 150 155 160 

Lys Leu Tyr Cys Gin He Ala Lys Thr Cys Pro He Gin He Lys Val 

165 170 175 

Met Thr Pro Pro Pro Gin Gly Ala Val He Arg Ala Met Pro Val Tyr 

180 185 190 ! 

Lys Lys Ala Glu His Val Thr Glu Val Val Lys Arg Cys Pro Asn His 

195 200 205 

Glu Leu Ser Arg Glu Phe Asn Glu Gly Gin He Ala Pro Pro Ser His 

210 215 220 

Leu He Arg Val Glu Gly Asn Ser «is Ala Gin Tyr Val Glu Asp Pro 
225 230 235 240 
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lie 


Thr 


ciiy 


/ 

7\ — _ 

Arg 


bin oer vax i>eu 






OA 


•*Gly 


Thr 


b±U 


rne 


iiix i iii- vax xieu 








zoU 




Cys 


vai 


biy 


biy 


Mcu Ho ii Arg Arg 






one 




zoU 


Glu 


Thr 


Arg 


Asp 


biy bxn vai Leu 




290 








lie 


Cys 


Ala 


Cys 


Fro bxy Arg Asp 


•O AC 

305 










Arg 


i»y s 


Gin 


Gin 


Val Ser Asp Ser 










325 


Arg 


Pro 


Phe 


Arg 


Gin Asn rnr his 








340 




Lys 


Arg 


Arg 


Ser 


Pro Asp Asp Glu 






355 




n c r\ 

360 


Arg 


Glu 


Thr 


Tyr 


Glu Met Leu Leu 




370 






375 


Met 


Gin 


Tyr 


Leu 


Pro Gin His Thr 


385 








390 


Gin 


Gin 


Gin 


His 


Gin His Leu Leu 










405 


Pro 


Ser 


Ser 


Tyr 


Gly Asn Ser Ser 








420 




Met 


Asn 


Lys 


Leu 


-Pro Ser Val Ser 






435 




A A A 

440 


Asn 


Ala 


Leu 


Thr 


Pro Thr Thr lie 




450 






455 


Pro 


Met 


Met 


Gly 


Thr His Met Pro 


465 








470 


Ser 


Pro 


Thr 


Gin 


Ala Leu Pro Pro 










485 


His 


Cys 


Thr 


Pro 


Pro Pro Pro Tyr 








500 




He 


Trp 


Gin 


Val 








515 







Val Pro 


Tyr 


Glu Pro Pro Gin Val 


250 




255 


Tyr Asn 


Phe 


Met Cys Asn Ser Ser 


265 




270 


Pro lie 


Leu 


He He Val Thr Leu 






285 


Gly Arg 


Arg 


Cys. Phe Glu Ala Arg 






300 


Arg Lys 


Ala 


Asp Glu Asp Ser He 




315 


320 


Thr Lys 


Asn 


Gly Asp Gly Thr Lys 


330 




335 


Gly lie 


Gin 


Met Thr Ser He Lys 


345 




350 


Leu Leu 


Tyr 


Leu Pro Val Arg Gly 






365 


Lys He 


Lys 


Glu Ser Leu Glu Leu 






380 


He Glu 


Thr 


Tyr Arg Gin Gin Gin 




395 


400 


Gin Lys 


Gin 


Thr Ser He Gin Ser 


410 




415 


Pro Pro 


Leu 


Asn Lys Met Asn Ser 


425 




430 


Gin Leu 


He 


Asn Pro Gin Gin Arg 






445 


Pro Asp 


Gly 


Met Gly Ala Asn He 






4 60 


Met Ala 


Gly 


Asp Met Asn Gly Leu 




475 


480 


Pro Leu 


Ser 


Met Pro Ser Thr Ser 


490 




495 


Pro Thr 


Asp 


Cys Ser He Val Arg 


505 




510 



<210> 345 

<211> 1800 

<212> DNA 

<213> Homo sapiens 

<400> 345 

gcgcctcatt gccactgcag 
actggttgtt ttttaaacaa 
tgacattcgt atcatcactg 
ggaggtctga aaccctcgca 
agtcgttgga aacaggactc 
tttcatcggg ggtgtcaaca 
ctttattttc cgagtcatga 
agaggacttc gtctgeaaca 
tttcccggtg tcccacatcc 
gctgctggtg gecatgeatg 
aggagagaag aggaatgatt 
agaggggtcg ctgtggtgga 
agectttatg tatgtgtttt 
atgtgggatt gacccctgcc 



tgactaaagc tgggaagacg 
attctgatac aggegacate 
tgcaccattg gcttctaggc 
gagggatctt gccctcattc 
agggataaac cagegcaatg 
aacactccac cagcateggg 
tcctagtggt ggctgcccag 
cactgcaacc gggatgcaaa 
ggctgtgggc cctccagctg 
tggectacta caggcacgaa 
tcaaagacat agaggacatt 
cgtacaccag cagcatcttt 
acttccttta caatgggtac 
ccaaccttgt tgactgcttt 



ctggtcagtt cacctgcccc 60 
ctcactgacc gagcaaagat 120 
actccagtgg ggtaggagaa 180 
tttgggtctg aaacactggc 240 
gattggggga cgctgcacac 300 
aaggtgtgga tcacagtcat 360 
gaagtgtggg gtgacgagca 420 
aatgtgtgct atgaccactt 480 
atettegtet ccaccccagc 540 
accactcgca agttcaggcg 600 
aaaaagcaca aggtteggat 660 
ttccgaatca tctttgaagc 720 
cacctgccct gggtgttgaa 780 
atttctaggc caacagagaa 840 
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gaccgtgttt accattttta tgatttctgc gtctgtgatt tgcatgctgc ttaacgtggc 900 
agagttgtgc tacctgctgc tgaaagtgtg ttttaggaga tcaaagagag cacagacgca 960 
^aaaaaatcac cccaatcatg ccctaaagga gagtaagcag aatgaaatga atgagctgat 1020 
ttcagatagt ggtcaaaatg caatcacagg tttcccaagc taaacatttc aaggtaaaat 1080 
gtagctgcgt cataaggaga cttctgtctt ctccagaagg caataccaac ctgaaagttc 1140 
cttctgtagc ctgaagagtt tgtaaatgac tttcataata aatagacact tgagttaact 1200 
ttttgtagga tacttgctcc attcatacac aacgtaatca aatatgtggt ccatctctga 1260 
aaacaagaga ctgcttgaca aaggagcatt gcagtcactt tgacaggttc cttttaagtg 1320 
gactctctga caaagtgggt actttctgaa aatttatata actgttgttg ataaggaaca 1380 
tttatccagg aattgatacg tttattagga aaagatattt ttataggctt ggatgttttt 1440 
agttccgact ttgaatttat ataaagtatt tttataatga ctggtcttcc ttacctggaa 1500 
aaacatgcga tgttagtttt agaattacac cacaagtatc taaatttcca acttacaaag 1560 
ggtcctatct tgtaaatatt gttttgeatt gtctgttggc aaatttgtga actgtcatga 1620 
tacgcttaag gtgggaaagt gttcattgca caatatattt ttactgcttt ctgaatgtag 1680 
acggaacagt gtggaagcag aaggcttttt taactcatcc gtttggccga tcgttgcaga 1740 
ccactgggag atgtggatgt ggttgcctcc ttttgctcgt ccccgtggct taacccttct 1800 

<210> 346 
<211> 261 
<212> PRT 

<213> Homo sapiens 
<400> 346 

Met Asp Trp Gly Thr Leu His Thr Phe lie Gly Gly Val Asn Lys His 

1 .5 10 15 

Ser Thr Ser lie Gly Lys Val Trp lie Thr Val lie Phe lie Phe Arg 

20 25 30 

Val Met He Leu Val Val Ala Ala Gin Glu Val Trp Gly Asp Glu Gin 

35 40 45 

Glu Asp Phe Val Cys Asn Thr Leu Gin Pro Gly Cys Lys Asn Val Cys 

50 55 60 

Tyr Asp His Phe Phe Pro Val Ser His He Arg Leu Trp Ala Leu Gin 
65 70 75 ' 80 

Leu He Phe Val Ser Thr Pro Ala Leu Leu Va} Ala Met His Val Ala 

85 .90 95 

Tyr Tyr Arg His Glu Thr Thr Arg Lys Phe Arg Arg Gly Glu Lys Arg 

100 105 HO 

Asn Asp Phe Lys Asp He Glu Asp He Lys Lys His Lys Val Arg He 

115 120 125 

Glu Gly Ser Leu Trp Trp Thr Tyr Thr Ser Ser He Phe Phe Arg He 

130 135 140 

He Phe Glu Ala Ala Phe Met Tyr Val Phe Tyr Phe Leu Tyr Asn Gly 
145 150 155 160 

Tyr His Leu Pro Trp Val Leu Lys Cys Gly He Asp Pro Cys Pro Asn 

165 170 175 

Leu Val Asp Cys Phe He Ser Arg Pro Thr Glu Lys Thr Val Phe Thr 

180 185 190 

He Phe Met He Ser Ala Ser Val He Cys Met Leu Leu Asn Val Ala 

!95 200 205 

Glu Leu Cys Tyr Leu Leu Leu Lys Val Cys Phe Arg Arg Ser Lys Arg 

210 215 220 

Ala Gin Thr Gin Lys Asn His Pro Asn His Ala Leu Lys Glu Ser Lvs 
225 230 235 2 40 

Gin Asn Glu Met Asn Glu Leu He Ser Asp Ser Gly Gin Asn Ala He 

245 - 250 255 

Thr Gly Phe Pro Ser 
260 
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n <210> 347 
<211> 1740 
<212> DNA 

<213> Homo sapiens 
<400> 347 

atgaacaaac tgtatatcgg aaacctcagc 
atcttcaagg acgccaagat cccggtgtcg 
ttcgtggact gcccggacga gagctgggcc 
atagaactgc acgggaaacc catagaagtt 
cggaaacttc agatacgaaa tatcccgcct 
ctagtccagt atggagtggt ggagagctgt 
gttgtaaatg taacctattc cagtaaggac 
ggatttcagt tagagaattt caccttgaaa 
cagcaaaacc ccttgcagca gccccgaggt 
aggcaggggt ctccaggatc cgtatccaag 
ctggttccca cccaatttgt tggagccatc 
atcaccaaac agacccagtc taaaatcgat 
gagaagtcga ttactatcct ctctactcct 
ctggagatta tgcataagga agctcaagat 
attttagctc ataataactt tgttggacgt 
aaaattgagc aagacacaga cactaaaatc 
tataatccag aacgcactat tacagttaaa 
gaggagatca tgaagaaaat cagggagtct 
caagcacatt taattcctgg attaaatctg 
gggatgccac ctcccacctc agggccccct 
gagcaatcag aaacggagac tgttcatctg 
atcggcaagc agggccagca catcaagcag 
attgctccag cggaagcacc agatgctaaa 
gaggctcagt tcaaggctca gggaagaatt 
agtcctaaag aagaggtgaa acttgaagct 
agagttattg gaaaaggagg caaaacggtg 
gttgttgtcc ctcgtgacca gacacctgat 
ggtcacttct atgcttgcca ggttgcccag 
aagcagcacc aacaacagaa ggctctgcaa 



gagaacgccg ccccctcgga cctagaaagt 60 
ggacccttcc tggtgaagac tggctacgcg 120 
ctcaaggcca tcgaggcgct ttcaggtaaa 180 
gagcactcgg tcccaaaaag gcaaaggatt 240 
catttacagt gggaggtgct ggatagttta 300 
gagcaagtga acactgactc ggaaactgca 360 
caagctagac aagcactaga caaactgaat 420 
gtagcctata tccctgatga aacggccgcc 480 
cgccgggggc ttgggcagag gggctcctca 540 
cagaaaccat gtgatttgcc tctgcgcctg 600 
ataggaaaag aaggtgccac cattcggaac 660 
gtccaccgta aagaaaatgc gggggctgct 720 
gaaggcacct ctgcggcttg taagtctatt 780 
ataaaattca cagaagagat ccccttgaag 840 
cttattggta aagaaggaag aaatcttaaa 900 
acgatatctc cattgcagga attgacgctg 960 
ggcaatgttg agacatgtgc caaagctgag 1020 
tatgaaaatg atattgcttc tatgaatctt 1080 
aacgccttgg gtctgttccc acccacttca 1140 
tcagccatga ctcctcccta cccgcagttt 1200 
tttatcccag ctctatcagt cggtgccatc 1260 
ctttctcgct ttgctggagc ttcaattaag 1320 
gtgaggatgg tgattatcac tggaccacca 1380 
tatggaaaaa ttaaagaaga aaactttgtt 1440 
catatcagag tgccatcctt tgctgctggc 1500 
aatgaacttc agaatttgtc aagtgcagaa 1560 
gagaatgace aagtggttgt caaaataact 1620 
agaaaaattc aggaaattct gactcaggta 1680 
agtggaccac ctcagtcaag acggaagtaa 1740 



<210> 348 
<211> 579 
<212> PRT 

<213> Homo sapiens 



<400> 348 



Met Asn 


Lys 


Leu 


Tyr 


He 


Gly 


Asn 


Leu 


Ser 


Glu 


Asn 


Ala 


Ala Pro 


Ser 


1 






5 










10 








15 




Asp Leu 


Glu 


Ser 


He 


Phe 


Lys 


Asp 


Ala 


Lys 


He 


Pro 


Val 


Ser Gly 


Pro 






20 










25 










30 




Phe Leu 


Val 


Lys 


Thr 


Gly 


Tyr 


Ala 


Phe 


Val 


Asp 


Cys 


Pro 


Asp Glu 


Ser 




35 










40 










45 






Trp Ala 


Leu 


Lys 


Ala 


He 


Glu 


Ala 


Leu 


Ser 


Gly 


Lys 


He 


Glu Leu 


His 


50 










55 










60 








Gly Lys 


Pro 


He 


Glu 


Val 


Glu 


His 


Ser 


Val 


Pro 


Lys 


Arg 


Gin Arg 


He 


65 








70 










75 








80 


Arg Lys 


Leu 


Gin 


He 


Arg 


Asn 


He 


Pro 


Pro 


His 


Leu 


Gin 


Trp Glu 


Val 








85 










90 








95 




Leu Asp 


Ser 


Leu 


Leu 


Val 


Gin 


Tyr 


Gly 


Val 


Val 


Glu 


Ser 


Cys Glu 


Gin 
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Val Asn 

■i 

Lys Asp 
130 
Glu Asn 
145 

Gin Gin 

Arg Gly 

Pro Cys 

Ala He 
210 
Thr Gin 
225 

Glu Lys 

Cys Lys 

Phe Thr 

Gly Arg 
290 
Asp Thr 
305 

Tyr Asn 

Ala Lys 

Asn Asp 

Asn Leu 
370 
Pro Thr 
385 

Glu Gin 
Val Gly 
Arg Phe 



Ala Lys 
450 
Lys Ala 
465 

Ser Pro 

Phe Ala 

Leu Gin 

Pro Asp 
530 
Ala Cys 
545 

Lys Gin 



100 
Thr Asp 
115 

Gin Ala 

Phe Thr 

Asn Pro 

Ser Ser 
180 
Asp Leu 
195 

He Gly 

Ser Lys 

Ser He 

Ser He 
260 
Glu Glu 
275 

Leu He 

Asp Thr' 

Pro Glu 

Ala Glu 
340 
He Ala 
355 

Asn Ala 

Ser Gly 

Ser Glu 

Ala He 
420 
Ala Gly 
435 . 
Val Arg 



Ser Glu 

Arg Gin 

Leu Lys 
150 
Leu Gin 
165 

Arg Gin 

Pro Leu 

Lys Glu 

He Asp 
230 
Thr He 
245 

Leu Glu 

He Pro 

Gly Lys 

Lys He 
310 
Arg Thr 
325 

Glu Glu 

Ser Met 

Leu Gly 

Pro Pro 
390 
Thr Glu 
405 

He Gly 



Ala Ser 
Met Val 



Gin Gly 

Lys Glu 

Ala Gly 
500 
Asn Leu 
515 

Glu Asn 
Gin Val 
His Gin 



Arg He 
470 
Glu Val 
485 

Arg Val 

Ser Ser 

Asp Gin 

Ala Gin 
550 
Gin Gin 



105 

Thr Ala Val 

120 
Ala Leu Asp 
135 

Val Ala Tyr 

Gin Pro Arg 

Gly Ser Pro 
185 

Arg Leu Leu 

200 
Gly Ala Thr 
215 

Val His Arg 

Leu Ser Thr 

He Met His 
265 

Leu Lys He 

280 
Glu Gly Arg 
295 

Thr He Ser 

He Thr Val 

He Met Lys 
345 

Asn Leu Gin 

360 
Leu Phe Pro 
375 

Ser Ala Met 

Thr Val His 

Lys Gin Gly 
425 

He Lys He 

440 
He He Thr 
455 

Tyr Gly Lys 



Val Asn Val Thr 
125 

Lys Leu Asn Gly 
140 

He Pro Asp Glu 
155 

Gly Arg Arg Gly 
170 

Gly Ser Val Ser 

Val Pro Thr Gin 
205 

He Arg Asn He 
220 

Lys Glu Asn Ala 
235 

Pro Glu Gly Thr 
250 

Lys Glu Ala Gin 

Leu Ala His Asn 
285 

Asn Leu Lys Lys 
300 

Pro Leu Gin Glu 
315 

Lys Gly Asn Val 
330 

Lys He Arg Glu 

Ala His Leu He 
365 

Pro Thr Ser Gly 

( 380 
Thr Pro Pro Tyr 
395 

Leu Phe He Pro 
410 

Gin His He Lys 



Ala Pro Ala Glu 
445 

Gly Pro Pro Glu 
460 

He Lys Glu Glu 
475 

Lys Leu Glu Ala His He Arg 
490 

Gly Gly Lys Thr 



He Gly Lys 
505 

Ala Glu Val 

520 
Val Val Val 
535 

Arg Lys He 
Lys Ala Leu 



Val Val Pro Arg 
525 

Lys He Thr Gly 
540. 

Gin Glu He Leu 
555 

Gin Ser Gly Pro 



110 

Tyr Ser Ser 

Phe Gin Leu 

Thr Ala Ala 
160 

Leu Gly Gin 

175 
Lys Gin Lys 
190 

Phe Val Gly 

Thr Lys Gin 

Gly Ala Ala 
240 

Ser Ala Ala 

255 
Asp He Lys 
270 

Asn Phe Val 

He Glu Gin 

Leu Thr Leu 
320 

Glu Thr Cys 

335 
Ser Tyr Glu 
350 

Pro Gly Leu 

Met Pro Pro 

Pro Gin Phe 
400 

Ala Leu Ser 
415 

Gin Leu Ser 
430 

Ala Pro Asp 

Ala Gin Phe 

Asn Phe Val 
480 

Val Pro Ser 
495 

Val Asn Glu 
510 

Asp Gin Thr 

His Phe Tyr 

Thr Gin Val 
560 

Pro Gin Ser 
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565 



Arg Arg Lys 



570 



575 



<210> 349 
<211> 207 
<212> DNA 
<213> Homo sapiens 

<400> 349 

atgtggcagc ccctcttctt caagtggctc ttgtcctgtt gccctgggag ttctcaaatt 60 
gctgcagcag cctccaccca gcctgaggat gacatcaata cacagaggaa gaagagtcag 120 
gaaaagatga gagaagttac agactctcct gggcgacccc gagagcttac cattcctcag 180 
acttcttcac atggtgctaa cagattt 207 

<210> 350 

<211> 69 

<212> PRT 

<213> Homo sapiens 

<400> 350 

Met Trp Gin Pro Leu Phe Phe Lys Trp Leu Leu Ser Cys Cys Pro Gly 

15 10 15 

Ser Ser Gin lie Ala Ala Ala Ala Ser Thr Gin Pro Glu Asp Asp lie 

20 25 30 

Asn Thr Gin Arg Lys Lys Ser Gin Glu Lys Met Arg Glu Val Thr Asp 

35 40 45 

Ser Pro Gly Arg Pro Arg Glu Leu Thr He Pro Gin Thr Ser Ser His 

50 55 60 

Gly Ala Asn Arg Phe 
65 



<210> 351 

<211> 1012 

<212> DNA 

<213> Homo sapiens 

<400> 351 

ccctctagaa ataattttgt ttaactttaa 
catcacacgg ccgcgtccga taacttccag 
ccgatcgggc aggcgatggc gatcgcgggc 
cctaccgcct tcctcggctt gggtgttgtc 
cgcgtggtcg ggagcgctcc ggcggcaagt 
gcggtcgacg gcgctccgat caactcggcc 
catcccggtg acgtcatctc ggtgacctgg 
aacgtgacat tggccgaggg acccccggcc 
ttcatcgggg gtgtcaacaa acactccacc 
tttattttcc gagtcatgat cctcgtggtg 
gaggacttcg tctgcaacac actgcaaccg 
ttcccggtgt cccacatccg gctgtgggcc 
ctgctggtgg ccatgcatgt ggcctactac 
ggagagaaga ggaatgattt caaagacata 
gaggggtgac tcgagcacca ccaccaccac 
aaggaagctg agttggctgc tgccaccgct 
tctaaacggg tcttgagggg ttttttgctg 



gaaggagata tacatatgca tcaccatcac 60 
ctgtcccagg gtgggcaggg attcgccatt 120 
cagatcaagc ttcccaccgt tcatatcggg 180 
gacaacaacg gcaacggcgc acgagtccaa 240 
ctcggcatct ccaccggcga cgtgatcacc 300 
accgcgatgg cggacgcgct taacgggcat 360 
caaaccaagt cgggcggcac gcgtacaggg 420 
gaattcatgg attgggggac gctgcacact 480 
agcatcggga 'aggtgtggat cacagtcatc 540 
gctgcccagg aagtgtgggg tgacgagcaa 600 
ggatgcaaaa atgtgtgcta t;gaccacttt 660 
ctccagctga tcttcgtctc caccccagcg 720 
aggcacgaaa ccactcgcaa gttcaggcga 780 
gaggacatta aaaagcagaa ggttcggata 840 
cactgagatc cggctgctaa caaagcccga 900 
gagcaataac tagcataacc ccttggggcc 960 
aaaggaggaa ctatatccgg at 1012 
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<210> 352 ' 
<211> 267 
>; <212> PRT 
<213> Homo sapiens 

<400> 352 

Met His His His His His His Thr Ala Ala Ser Asp Asn Phe Gin Leu 

1 5 10 15 

Ser Gin Gly Gly Gin Gly Phe Ala lie Pro He Gly Gin Ala Met Ala 

20 25 30 

lie Ala Gly Gin He Lys Leu Pro Thr Val His He Gly Pro Thr Ala 

35 40 45 

Phe Leu Gly Leu Gly Val Val Asp Asn Asn Gly Asn Gly Ala Arg Val 

50 55 60 

Gin Arg Val Val Gly Ser Ala Pro Ala Ala Ser Leu Gly He Ser Thr 
65 70 75 80 

Gly Asp Val He Thr Ala Val Asp Gly Ala Pro He Asn Ser Ala Thr 

85 90 95 

Ala Met Ala Asp Ala Leu Asn Gly His His Pro Gly Asp Val He Ser 

100 105 no 

Val Thr Trp Gin Thr Lys Ser Gly Gly Thr Arg Thr Gly Asn Val Thr 

115 120 125 

Leu Ala Glu Gly Pro Pro Ala Glu Phe Met Asp Trp Gly Thr Leu His 

130 135 140 

Thr Phe lie Gly Gly Val Asn Lys His Ser Thr Ser He Gly Lys Val 
I 45 150 155 16O 

Trp He Thr Val He Phe He Phe Arg Val Met He Leu Val Val Ala 

165 170 175 

Ala Gin Glu Val Trp Gly Asp Glu Gin Glu Asp Phe Val Cys Asn Thr 

180 185 190 

Leu Gin Pro Gly Cys Lys Asn Val Cys Tyr Asp His Phe Phe Pro Val 

195 200 205 

Ser His lie Arg Leu Trp Ala Leu Gin Leu He Phe Val Ser Thr Pro 

210 215 220 

Ala Leu Leu Val Ala Met His Val Ala Tyr Tyr Arg His -Glu Thr Thr 
225 230 235 " 240 

Arg Lys Phe Arg Arg Gly Glu Lys Arg Asn Asp Phe Lys Asp He Glu 

245 250 255 

Asp lie Lys Lys Gin Lys Val Arg He Glu Gly 
260 265 



<210> 353 
<211> 900 
<212> DNA 
<213> Homo sapiens 

<400> 353 

atgcatcacc atcaccatca 
cagggattcg ccattccgat 
accgttcata tcgggcctac 
ggcgcacgag tccaacgcgt 
ggcgacgtga tcaccgcggt 
gcgcttaacg ggcatcatcc 
ggcacgcgta cagggaacgt 
actcgcaagt tcaggcgagg 
aagcagaagg ttcggataga 
cgaatcatct ttgaagcagc 



cacggccgcg tccgataact 
cgggcaggcg atggcgatcg 
cgccttcctc ggcttgggtg 
ggtcgggagc gctccggcgg 
cgacggcgct ccgatcaact 
cggtgacgtc atctcggtga 
gacattggcc gagggacccc 
agagaagagg aatgatttca 
ggggtcgctg tggtggacgt 
ctttatgtat gtgttttact 



tccagctgtc ccagggtggg 60 
cgggccagat caagcttccc 120 
ttgtcgacaa caacggcaac 180 
caagtctcgg catctccacc 240 
cggccaccgc gatggcggac 300 
cctggcaaac caagtcgggc 360 
cggccgaatt ccacgaaacc 420 
aagacataga ggacattaaa 480 
acaccagcag catctttttc 540 
tcctttacaa tgggtaccac 600 
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ctgccctggg tgttgaaatg 
tctaggccaa cagagaagac 
atgctgctta acgtggcaga 
aagagagcac agacgcaaaa 
gaaatgaatg agctgatttc 



tgggattgac ccctgcccca 
cgtgtttacc atttttatga 
gttgtgctac ctgctgctga 
aaatcacccc aatcatgccc 
agatagtggt caaaatgcaa 



accttgttga ctgctttatt 660 
tttctgcgtc tgtgatttgc 720 
aagtgtgttt taggagatca 780 
taaaggagag taagcagaat 840 
tcacaggttt cccaagctaa 900 



<210> 354 
<211> 299 
<212> PRT 
<213> Homo sapiens 



<400> 354 



Met 


His 


His His 


His His His 


Thr 


1 






5 




Ser 


Gin 


Gly Gly 


Gin Gly Phe Ala 






20 






lie 


Ala 


Gly Gin 


He Lys Leu 


Pro 






35 




40 


Phe 


Leu 


Gly Leu 


Gly Val Val 


.Asp 




50 




55 




Gin 


Arg 


Val Val 


Gly Ser Ala 


Pro 


65 






70 




Gly 


Asp 


Val He 


Thr Ala Val 


Asp 








85 




Ala 


Met 


Ala Asp 


Ala Leu Asn 


Gly 






100 






Val 


Thr 


Trp Gin 


Thr Lys Ser 


Gly 






115 




120 


Leu 


Ala 


Glu Gly 


Pro Pro Ala 


Glu 




130 




135 




Arg 


Arg 


Gly Glu 


Lys Arg Asn 


Asp 


145 






150 




Lys 


Gin 


Lys Val 


Arg He Glu 


Gly 








165 




Ser 


lie 


Phe Phe 


Arg He He 


Phe 






180 






Tyr 


Phe 


Leu Tyr 


Asn Gly Tyr 


His 






195 




200 


lie 


Asp 


Pro Cys 


Pro Asn Leu 


Val 




210 




215 




Glu 


Lys 


Thr Val 


Phe Thr He 


Phe 


225 






230 




Met 


Leu 


Leu Asn 


Val Ala Glu 


Leu 








245 




Phe 


Arg 


Arg Ser 


Lys Arg Ala 


Gin 






260 






Ala 


Leu 


Lys Glu 


Ser Lys Gin 


Asn 






275 




280 


Ser 


Gly 


Gin Asn 


Ala He Thr 


Gly 




290 




295 





Ala 


Ala 


Ser 


Asp 


Asn 


Phe 


Gin 


Leu 




10 










15 




He 


Pro 


He 


Gly 


Gin 


Ala 


Met 


Ala 


25 










30 






Thr 


Val 


His 


He 


Gly 


Pro 


Thr 


Ala 










45 








Asn 


Asn 


Gly 


Asn 


Gly 


Ala 


Arg 


Val 








60 










Ala 


Ala 


Ser 


Leu 


Gly 


He 


Ser 


Thr 






75 










80 


Gly 


Ala 


Pro 


He 


Asn 


Ser 


Ala 


Thr 




90 










95 




His 


His 


Pro 


Gly 


Asp 


Val 


He 


Ser 


105 










110 






Gly 


Thr 


Arg 


Thr 


Gly 


Asn 


Val 


Thr 










125 








Phe 


His 


Glu 


Thr 


Thr 


Arg 


Lys 


Phe 








140 










Phe 


Lys* 


Asp 


He 


Glu 


Asp 


He 


Lys 






155 










160 


Ser 


Leu 


Trp 


Trp 


Thr 


Tyr 


Thr 


Ser 




170 










175 




Glu 


Ala 


Ala 


Phe 


Met 


Tyr 


Val 


Phe 


185 










190 






Leu 


Pro 


Trp 


Val 


Leu 


Lys 


Cys 


Gly 










205 








Asp 


Cys 


Phe 


He 


Ser 


Arg 


Pro 


Thr 








220 










Met 


He 


Ser 


Ala 


Ser 


Val 


He 


Cys 






235 










240 


Cys 


Tyr 


Leu 


Leu 


Leu 


Lys 


Val 


Cys 




250 










255 




Thr 


Gin 


Lys 


Asn 


His 


Pro 


Asn 


His 


265 










270 






Glu 


Met 


Asn 


Glu 


Leu 


He 


Ser 


Asp 



285 



Phe Pro Ser 



<210> 355 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> PCR primer 
<400> 355 

ggagtacagc ttcaagacaa tggg 

<210> 356 
<211> 31 
<212> DNA 
/ <213> Artificial Sequence 

<220> 

<223> PCR primer 



<400> 356 

ccatgggaat tcattataat aattttgttc c 31 

<210> 357 
<211> 920 
<212> PRT 
<213> Homo sapiens 



<400> 357 

Met Gin His His His His His His Gly Val Gin Leu Gin Asp Asn Gly 

1 5 10 15 

Tyr Asn Gly Leu Leu lie Ala He Asn Pro Gin Val Pro Glu Asn Gin 

20 25 30 

Asn Leu He Ser Asn He Lys Glu Met He Thr Glu Ala Ser Phe Tyr 

35 40 45 

Leu Phe Asn Ala Thr Lys Arg Arg Val Phe Phe Arg Asn He Lys He 

50 55 60 

Leu He Pro Ala Thr Trp Lys Ala Asn Asn Asn Ser Lys He Lys Gin 
65 70 75 ' 80 

Glu Ser Tyr Glu Lys Ala Asn Val He Val Thr Asp Trp Tyr Gly Ala 

85 90 , ~ 95 

His Gly Asp Asp Pro Tyr Thr Leu Gin Tyr Arg Gly Cys Gly Lys Glu 

100 105 no 

Gly Lys Tyr He His Phe Thr Pro Asn Phe Leu Leu Asn Asp Asn Leu 

115 120 125 

Thr Ala Gly Tyr Gly Ser Arg Gly Arg Val Phe Val His Glu Trp Ala 

130 135 140 

His Leu Arg Trp Gly Val Phe Asp Glu Tyr Asn Asn Asp Lys Pro Phe 
145 150 155 ~ 160 

Tyr He Asn Gly Gin Asn Gin He Lys Val . Thr Arg Cys Ser Ser Asp 

165 170 " 175 

He Thr Gly He Phe Val Cys Glu Lys Gly Pro Cys Pro Gin Glu Asn 

180 185 190 

Cys He He Ser Lys Leu Phe Lys Glu Gly Cys Thr Phe He Tyr Asn 

1^5 200 205 

Ser Thr Gin Asn Ala Thr Ala Ser He Met Phe Met Gin Ser Leu Ser 

210 215 220 

Ser Val Val Glu Phe Cys Asn Ala Ser Thr His Asn Gin Glu Ala Pro 
225 230 235 240 

Asn Leu Gin Asn Gin Met Cys Ser Leu Arg Ser Ala Trp Asp Val He 

245 250 * 255 

Thr Asp Ser Ala Asp Phe His His Ser Phe Pro Met Asn Gly Thr Glu 

260 265 270 

Leu Pro Pro Pro Pro Thr Phe Ser Leu Val Glu Ala Gly Asp Lys Val 
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275 280 285 

Val Cys Leu Val Leu Asp Val Ser Ser Lys Met Ala Glu Ala Asp Arg 
* 290 295 300 

Leu Leu Gin Leu Gin Gin Ala Ala Glu Phe Tyr Leu Met Gin He Val 
305 310 315 320 

Glu He His Thr Phe Val Gly He Ala Ser Phe Asp Ser Lys Gly Glu 

325 330 335 

He Arg Ala Gin Leu His Gin He Asn Ser Asn Asp Asp Arg Lys Leu 

340 345 350 

Leu Val Ser Tyr Leu Pro Thr Thr Val Ser Ala Lys Thr Asp He Ser 

355 360 365 

He Cys Ser Gly Leu Lys Lys. Gly Phe Glu Val Val Glu Lys Leu Asn 

370 375 380 

Gly Lys Ala Tyr Gly Ser Val Met He Leu Val Thr Ser Gly Asp Asp 
385 390 395 ^ 400 

Lys Leu Leu Gly Asn Cys Leu Pro Thr Val Leu Ser Ser Gly Ser Thr 

405 410 415 

He His Ser- He Ala Leu Gly Ser Ser Ala Ala Pro Asn Leu Glu Glu 

420 425 430 

Leu Ser Arg Leu Thr Gly Gly Leu Lys Phe Phe Val Pro Asp He Ser 

435 440 445 

Asn Ser Asn Ser Met He Asp Ala Phe Ser Arg lie Ser Ser Gly Thr 

450 455 460 

Gly Asp He Phe Gin Gin His He Gin Leu Glu Ser Thr Gly Glu Asn 
465 470 475 480 

Val Lys Pro His His Gin Leu Lys Asn Thr Val Thr Val Asp Asn Thr 

485 490 495 

Val Gly Asn Asp Thr Met Phe Leu Val Thr Trp Gin Ala Ser Gly Pro 

500 505 510 

Pro Glu He He Leu Phe Asp Pro Asp Gly Arg Lys Tyr Tyr Thr Asn 

515 520 525 

Asn Phe He Thr Asn Leu Thr Phe Arg Thr Ala Ser Leu Trp He Pro 

530 535 540 

Gly Thr Ala Lys Pro Gly His Trp Thr Tyr Thr Leu Asn Asn Thr His 
545 550 55^ 560 

His Ser Leu Gin Ala Leu Lys Val Thr Val Thr Ser Arg Ala Ser Asn 

565 . 570 575 

Ser Ala Val Pro Pro Ala Thr Val Glu Ala Phe Val Glu Arg Asp Ser 

580 585 590 

Leu His Phe Pro His Pro Val Met He Tyr Ala Asn Val Lys Gin Gly 

595 600 605 

Phe Tyr Pro He Leu Asn Ala Thr Val Thr Ala Thr Val Glu Pro Glu 

610 615 620 

Thr Gly Asp Pro Val Thr Leu Arg Leu Leu Asp Asp Gly Ala Gly Ala 
625 630 635 640 

Asp Val He Lys Asn Asp Gly He Tyr Ser Arg Tyr Phe Phe Ser Phe 

645 650 655 

Ala Ala Asn Gly Arg Tyr Ser Leu Lys Val His Val Asn His Ser Pro 

660 665 670 

Ser He Ser Thr Pro Ala His Ser He Pro Gly Ser His Ala Met Tyr 

675 680 685 , 

Val Pro Gly Tyr Thr Ala Asn Gly Asn He Gin Met Asn Ala Pro Arg 

690 695 700 

Lys Ser Val Gly Arg Asn Glu Glu Glu Arg Lys Trp Gly phe Ser Arg 
705 710 715 720 

Val Ser Ser Gly Gly Ser Phe Ser Val Leu Gly Val Pro Ala Gly Pro 

725 730 735 

His Pro Asp Val Phe Pro Pro Cys Lys He He Asp Le u Glu Ala Val 
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161 



740 

Lys Val Glu Glu 
{ 755 
Phe Asp Gin Gly 
770 

Leu Gin Asn He 
785 

Ser Lys Arg Asn 

Ser Pro Gin He 
820 

Thr His Glu Ser 
835 

Asn Ser Leu Gin 
850 

He Pro Pro Asn 
865 

Lys Gly Val Leu 



Glu Leu 

Gin Ala 

Gin Asp 
790 
Pro Gin 
805 

Ser Thr 

His Arg 

Ser Ala 

Ser Asp 
870 
Thr Ala 
885 

His His 



He Val Val Thr 
900 

Lys Glu Asn Gly Thr Lys 
915 



Thr Leu 
760 
Thr Ser 
775 

Asp Phe 

Gin Ala 

Asn Gly 

He Tyr 
840 
Val Ser 
855 

Pro Val 

Met Gly 

Thr Leu 

Leu Leu 
920 



745 

Ser Trp Thr 

Tyr Glu He 

Asn Asn Ala 
795 

Gly He Arg 

810 
Pro Glu His 
825 

Val Ala He 

Asn He Ala 

Pro Ala Arg 
875 

Leu He Gly 

890 
Ser Arg Lys 
905 



Ala Pro 
7 65 
Arg Met 
780 

He Leu 

Glu He 

Gin Pro 

Arg Ala 
845 
Gin Ala 
860 

Asp Tyr 
He He 
Lys Arg 



750 

Gly Glu Asp 

Ser Lys Ser 

Val Asn Thr 
800 

Phe Thr Phe 

815 
Asn Gly Glu 
830 ' 

Met Asp Arg 

Pro Leu Phe 

Leu He Leu 
880 

Cys Leu He 

895 
Ala Asp Lys 
910 



<210> 358 
<211> 2773 
<212> DNA 

<213> Homo sapiens 
<400> 358 

catatgcagc atcaccacca tcaccacgga 
ttgctcattg caattaatcc tcaggtacct 
gaaatgataa ctgaagcttc attttaccta 
agaaatataa agattttaat acctgccaca 
caagaatcat atgaaaaggc aaatgtcata 
gatccataca ccctacaata cagagggtgt 
cctaatttcc tactgaatga taacttaaca 
gtccatgaat gggcccacct ccgttggggt 
ttctacataa atgggcaaaa tcaaattaaa 
atttttgtgt gtgaaaaagg tccttgcccc 
aaagaaggat gcacctttat ctacaatagc 
atgcaaagtt tatcttctgt ggttgaattt 
ccaaacctac agaaccagat gtgcagcctc 
gctgactttc accacagctt tcccatgaac 
tcgcttgtag aggctggtga caaagtggtc 
gcagaggctg acagactcct tcaactacaa 
gttgaaattc ataccttcgt gggcattgcc 
cagctacacc aaattaacag caatgatgat 
actgtatcag ctaaaacaga catcagcatt 
gttgaaaaac tgaatggaaa agcttatggc 
gataagcttc ttggcaattg cttacccact 
attgccctgg gttcatctgc agccccaaat 
ttaaagttct ttgttccaga tatatcaaac 
atttcctctg gaactggaga cattttccag 
aatgtcaaac ctcaccatca attgaaaaac 
gacactatgt ttctagttac gtggcaggcc 
cctgatggac gaaaatacta cacaaataat 



gtacagcttc aagacaatgg gtataatgga 60 
gagaatcaga acctcatctc aaacattaag 120 
tttaatgcta ccaagagaag agtatttttc 180 
tggaaagcta ataataacag caaaataaaa 240 
gtgactgact ggtatggggc acatggagat 300 
ggaaaagagg gaaaatacat tcatttcaca 360 
gctggctacg gatcacgagg ccgagtgttt 420 
gtgttcgatg agtataacaa tgacaaacct 480 
gtgacaaggt gttcatctga catcacaggc 540 
caagaaaact gtattattag taagcttttt 600 
acccaaaatg caactgcatc aataatgttc 660 
tgtaatgcaa gtacccacaa ccaagaagca 720 
agaagtgcat gggatgtaat cacagactct 780 
gggactgagc ttccacctcc tcccacattc 840 
tgtttagtgc tggatgtgtc cagcaagatg 900 
caagccgcag aattttattt gatgcagatt 960 
agtttcgaca gcaaaggaga gatcagagcc 1020 
cgaaagttgc tggtttcata tctgcccacc 1080 
tgttcagggc ttaagaaagg atttgaggtg 1140 
tctgtgatga tattagtgac qagcggagat 1200 
gtgctcagca gtggttcaac aattcactcc 1260 
ctggaggaat tatcacgtct tacaggaggt 1320 
tccaatagca tgattgatgc tttcagtaga 1380 
caacatattc agcttgaaag tacaggtgaa 1440 
acagtgactg tggataatac tgtgggcaac 1500 
agtggtcctc ctgagattat attatttgat 1560 ' 
tt tat caeca atctaacttt teggacaget 1620 
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agtctttgga ttccaggaac agctaagcct gggcactgga cttacaccct gaacaatacc 1680 

catcattctc tgcaagccct gaaagtgaca gtgacctctc gcgcctccaa ctcagctgtg 1740 

^cccccagcca ctgtggaagc ctttgtggaa agagacagcc tccattttcc tcatcctgtg 1800 

atgatttatg ccaatgtgaa acagggattt tatcccattc ttaatgccac tgtcactgcc 1860 

acagttgagc cagagactgg agatcctgtt acgctgagac tccttgatga tggagcaggt 1920 

gctgatgtta taaaaaatga tggaatttac tcgaggtatt ttttctcctt tgctgcaaat 1980 

ggtagatata gcttgaaagt gcatgtcaat cactctccca gcataagcac cccagcccac 2040 

tctattccag ggagtcatgc tatgtatgta ccaggttaca cagcaaacgg taatattcag 2100 

atgaatgctc caaggaaatc agtaggcaga aatgaggagg agcgaaagtg gggctttagc 2160 

cgagtcagct caggaggctc cttttcagtg ctgggagttc cagctggccc ccaccctgat 2220 

gtgtttccac catgcaaaat tattgacctg gaagctgtaa aagtagaaga ggaattgacc 2280 

ctatcttgga cagcacctgg agaagacttt gatcagggcc aggctacaag ctatgaaata 2340 

agaatgagta aaagtctaca gaatatccaa gatgacttta acaatgctat tttagtaaat 2400 

acatcaaagc gaaatcctca gcaagctggc atcagggaga tatttacgtt ctcaccccaa 2460 

atttccacga atggacctga acatcagcca aatggagaaa cacatgaaag ccacagaatt 2520 

tatgttgcaa tacgagcaat ggataggaac tccttacagt ctgctgtatc taacattgcc 2580 

caggcgcctc tgtttattcc ccccaattct gatcctgtac ctgccagaga ttatcttata 2640 

ttgaaaggag ttttaacagc aatgggtttg ataggaatca tttgccttat tatagttgtg 2700 

acacatcata ctttaagcag gaaaaagaga gcagacaaga aagagaatgg aacaaaatta 2760 

ttataatgaa ttc 2773 

<210> 359 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer 
<400> 359 

tggcagcccc tcttcttcaa gtggc 25 

<210> 360 
<211> 33 

<212> DNA , 
<213> Artificial Sequence 



<220> 

<223> PCR primer 
<400> 360 

cgccagaatt catcaaacaa atctgttagc acc 

<210> 361 
<211> 77 
<212> PRT 

<213> Homo sapiens 



33 



<400> 361 

Met Gin His His His His His His Trp Gin Pro Leu Phe Phe Lys Trp 

1 5 10 ,15 

Leu Leu Ser Cys Cys Pro Gly Ser Ser Gin lie Ala Ala Ala Ala Ser 

20 25 30 

Thr Gin Pro Glu Asp Asp lie Asn Thr Gin Arg Lys Lys Ser Gin Glu 

35 40 45 

Lys Met Arg Glu Val Thr Asp Ser Pro Gly Arg Pro Arg Glu Leu Thr 

50 55 60 

He Pro Gin Thr Ser Ser His Gly Ala Asn Arg Phe Val 
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65 70 75 



<210> 362 
<211> 244 
<212> DNA 

<213> Homo sapiens 
<400> 362 

catatgcagc atcaccacca tcaccactgg cagcccctct tcttcaagtg gctcttgtcc 60 
tgttgccctg ggagttctca aattgctgca gcagcctcca cccagcctga ggatgacatc 120 
aatacacaga ggaagaagag tcaggaaaag atgagagaag ttacagactc tcctgggcga 180 
ccccgagagc ttaccattcc tcagacttct tcacatggtg ctaacagatt tgtttgatga 240 
attc " 244 

<210> 363 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 363 

Met Trp Gin Pro Leu Phe Phe Lys Trp Leu Leu Ser Cys Cys Pro Gly 

1 5 10 15 

Ser Ser Gin lie 
20 



<210> 364 
<211> 60 
<212> DNA 

<213> Homo sapiens 
<400> 364 

atgtggcagc ccctcttctt caagtggctc ttgtcctgtt gccctgggag ttctcaaatt 60 

<210> 365 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 365 

Gly Ser Ser Gin lie Ala Ala Ala Ala Ser Thr Gin Pro Glu Asp Asp 

1 5 10 is 

He Asn Thr Gin 
20 



<210> 366 
<211> 60 
<212> DNA 

<213> Homo sapiens 
<400> 366 

gggagttctc aaattgctgc agcagcctcc acccagcctg aggatgacat caatacacag 60 
<210> 367 
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<211> 20 
<212> PRT 

V213> Homo sapiens 
<400> 367 

Lys Pro Gly His Trp Thr Tyr Thr Leu Asn Asn Thr His His Ser Leu 

1 5 10 15 

Gin Ala Leu Lys 
20 



<210> 368 

<211> 2343 

<212> DNA 

<213> Homo sapiens 

<400> 368 

attccggagc gtttgcggct tcgcttcatg 
gggagctggg gagcccgcag cggcccggag 
gcgccgcgcc tctgaggcgc agcatgtgaa 
tctcagccgg ccgggatggc taccacggcc 
gaatatattg aacgtcttgt atggagaacc 
gcttttgatc ctaaaagatt attagaagaa 
atggatgaaa ggattcagag gaaagtagag 
aaggaatttg ccaagaaggt acaagagctg 
ttccaagaac tagatgagca cattagctat 
cagttagagg gggtaaacac acccagacaa 
tactttaatg agtttctaga tggagaattg 
ataaaggaag cagcagacat cattcagaag 
gatagatttt cagaagttaa atccaaaatt 
ctgattcagg agtttaccag tgctcaaaga 
gcagcagttt tacttcattt taagggttat 
tgccaggagg gtgcttattt gagaaatgat 
agagtgaaca aacaagttgg agatatcttc 
attcaaaatg tatttgaaat caaactacag 
aggaagtccg atgcagagca atatctcaaa 
aatctttcca gcaagctgat ggagtttaat 
aagcttatca aatccatttt catttcctat 
tatttgaaaa gcagaagtgc tatgatccta 
aagagatcca ttggcacagg aggtattcaa 
aacttaccac ttgggccaag tat.cgatact 
gtggttaatc ttttacaaga aaccaaacaa 
ccttctgact taccaaggaa tgccttcaga 
attgagcata ttgattatgc tttggaaaca 
aggaatgcaa atctttattt tttggacgtt 
tttgacaaac agtttaatga tcaccttatg 
gaatgccttc agaagaaaaa agaaataatt 
attgatagga cattaaattg tatgattgga 
aagaaaacag attttaagcc agaagatgaa 
tgtgtaaaag tctgtgctta cgtaagaaaa 
gggaagaatg tggatacagt tttgatggaa 
gagcatcttc aacaatattc ctacagttgt 
gccgaatata ggaagtgtgc caaagacttc 
actctgcatg ctctttgcaa tcttctggta 
tcaggagaac aacttgctaa tctggacaag 
gctgattata gatctgcccg ccttgctcga 
att 



gccgctctcc cgcccctcct gggatctgtg 60 
ccggagctgg cgagccgagc ggagacctgt 120 
gcggagacgg catccagtgg ggggcgagcc 180 
gagctcttcg aggagccttt tgtggcagat 240 
ccaggaggag gctctagagg tggacctgaa 300 
tttgtaaatc atattcagga actccagata 360 
aaactagagc aacaatgtca gaaagaagcc 420 
cagaaaagca atcaggttgc cttccaacat 480 
gtagcaacta aagtctgtca ccttggagac 540 
cgggcagtgg aggctcagaa attgatgaaa 600 
aaatctgatg tttttacaaa ttctgaaaag 660 
ttgcacctaa ttgcccaaga gttacctttt 720 
gcaagtaaat accatgattt agaatgccag 780 
agaggtgaaa tctccagaat gagagaagta 840 
tcccattgtg ttgatgttta tataaagcag 900 
atatttgaag acgctggaat actctgtcaa 960 
agtaatccag aaacagtcct ggctaaactt 1020 
agttttgtga aagagcagtt agaagaatgt 1080 
aatctctatg atctgtatac aagaaccacc 1140 
ttaggtactg ataaacagac tttcttgtct 1200 
ttggagaact atattgaggt ggagactgga 1260 
cagcgctatt atgattcgaa aaaccatcaa 1320 
gatttgaagg aaagaattag acagcgtacc 1380 
catggggaga cttttctatc ccaagaagtg 1440 
gcctttgaaa gatgtcatag gctctctgat 1500 
atttttacca ttcttgtgga atttttatgt 1560 
ggacttgctg gaattccctc ttcagattct 1620 
gtgcaacagg ccaatactat ttttcatctt 1680 
ccactaataa gctcttctcc taagttatct 1740 
gaacaaatgg agatgaaatt ggatactggc 1800 
cagatgaagc atattttggc tgcagaacag 1860 
aacaatgttt tgattcaata tactaatgcc 1920 
caagtggaga agattaaaaa ttccatggat 1980 
cttggagtac gttttcatcg acttatctat 2040 
atgggtggca tgttggccat ttgtgatgta 2100 
aagattccaa tggtattaca tctttttgat 2160 
gttgccccag ataatttaaa gcaagtctgc 2220 
aatatacttc actccttcgt acaacttcgt 2280 
cacttcagct gagattgaat ttacaaagga 2340 

2343 



WO 02/00174 



PCT/US01/21065 



165 



<210> 369 
<211> 708 
X212> PRT 
<213>Homo sapiens 

<400> 369 

Met Ala Thr Thr Ala Glu Leu Phe Glu Glu Pro Phe Val Ala Asp Glu 

1 5 10 15 

Tyr lie Glu Arg lieu Val Trp Arg Thr Pro Gly Gly Gly Ser Arg Gly 

20 25 30 

Gly Pro Glu Ala Phe Asp Pro Lys Arg Leu Leu Glu Glu Phe Val Asn 

35 40 45 

His He Gin Glu Leu Gin He Met Asp Glu Arg He Gin Arg Lys Val 

50 55 -60 

Glu Lys Leu Glu Gin Gin Cys Gin Lys Glu Ala Lys Glu Phe Ala Lys 
6 5 70 75 80 

Lys Val Gin Glu Leu Gin Lys Ser Asn Gin Val Ala Phe Gin His Phe 

85 90 95 

Gin Glu Leu Asp Glu His He Ser Tyr Val Ala Thr Lys Val Cys His 

100 105 no 

Leu Gly Asp Gin Leu Glu Gly Val Asn Thr Pro Arg Gin Arg Ala Val 

115 120 125 

Glu Ala Gin Lys Leu Met Lys Tyr Phe Asn Glu Phe Leu Asp Gly Glu 

130 135 140 

Leu Lys Ser Asp Val Phe Thr Asn Ser Glu Lys He Lys Glu Ala Ala 
"5 150 155 160 

Asp He He Gin Lys Leu His Leu He Ala Gin Glu Leu Pro Phe Asp 

165 170 175 

Arg Phe Ser Glu Val Lys Ser Lys He Ala Ser Lys Tyr His Asp Leu 

180 185 190 

Glu Cys Gin Leu lie Gin Glu Phe Thr Ser Ala Gin Arg Arg Gly Glu 

195 200 205 

He Ser Arg Met Arg Glu Val Ala Ala Val Leu Leu His Phe Lys Gly 

210 215 220 

Tyr Ser His Cys Val Asp Val Tyr He Lys Gin Cys Gin Glu Gly Ala 
225 230 235 240 

Tyr Leu Arg Asn Asp He Phe Glu Asp Ala Gly He Leu Cys Gin Arg 

245- 250 255 

Val Asn Lys Gin Val Gly Asp He Phe Ser Asn Pro Glu Thr Val Leu 

260 265 270 

Ala Lys Leu He Gin Asn Val Phe Glu He Lys Leu Gin Ser Phe Val 

275 280 285 

Lys Glu Gin Leu Glu Glu Cys Arg Lys Ser Asp Ala Glu Gin Tyr Leu 

290 295 300 

Lys Asn Leu Tyr Asp Leu Tyr Thr Arg Thr Thr Asn Leu Ser Ser Lys 
305 310 315 320 

Leu Met Glu Phe Asn Leu Gly . Thr Asp Lys Gin Thr Phe Leu Ser Lys 

325 330 335 

Leu He Lys Ser He Phe He Ser Tyr Leu Glu Asn Tyr He Glu Val 

340 345 350 

Glu Thr Gly Tyr Leu Lys Ser Arg Ser Ala Met He Leu Gin Arq Tvr 

355 360 365 

Tyr Asp Ser Lys Asn His Gin Lys Arg Ser He Gly Thr Gly Gly He 

370 375 " 380 

Gin Asp Leu Lys Glu Arg He Arg Gin Arg Thr Asn Leu Pro Leu Gly 
385 390 395 400 

Pro Ser He Asp Thr His Gly Glu Thr Phe Leu Ser Gin Glu Val Val 
405 410 415 
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Val Asn Leu Leu Gin Glu Thr Lys Gin Ala Phe Glu Arg Cys His Arg 
. 420 425 430 

"Leu Ser Asp Pro Ser Asp Leu Pro Arg Asn Ala Phe Arg lie Phe Thr 

435 440 445 

lie Leu Val Glu Phe Leu Cys lie Glu His lie Asp Tyr Ala Leu Glu 

450 455 460 

Thr Gly Leu Ala Gly lie Pro Ser Ser Asp Ser Arg Asn Ala Asn Leu 
465 470 475 480 

Tyr Phe Leu Asp Val Val Gin Gin Ala Asn Thr He Phe His Leu Phe 

485 490 495 

Asp Lys Gin Phe Asn Asp His Leu Met Pro Leu He Ser Ser Ser Pro 

500 505 510 

Lys Leu Ser Glu Cys Leu Gin Lys Lys Lys Glu He He Glu Gin Met 

515 520 525 

Glu Met Lys Leu Asp Thr Gly He Asp Arg Thr Leu Asn Cys Met He 

530 535 { 540 

Gly Gin Met Lys His He Leu Ala Ala Glu Gin Lys Lys Thr Asp Phe 
545 550 555 " 560 

Lys Pro Glu Asp Glu Asn Asn Val Leu He Gin Tyr Thr Asn Ala Cys 

565 570 575 

Val Lys Val Cys Ala Tyr Val Arg Lys Gin Val Glu Lys He Lys Asn 

580 585 590 

Ser Met Asp Gly Lys Asn Val Asp Thr Val Leu Met Glu Leu Gly Val 

595 600 605 

Arg Phe His Arg Leu He Tyr Glu His Leu Gin Gin Tyr Ser Tyr Ser 

610 615 • 620 

Cys Met Gly Gly Met Leu Ala He Cys Asp Val Ala Glu Tyr Arg Lys 
625 630 635 640 

Cys Ala Lys Asp Phe Lys He Pro Met Val Leu His Leu Phe Asp Thr 

645 650 655 

Leu His Ala Leu Cys Asn Leu Leu Val Val Ala Pro Asp Asn Leu Lys 

660 665 670 

Gin Val Cys Ser Gly Glu Gin Leu Ala Asn Leu Asp Lys Asn He Leu 

675 680 685 

His Ser Phe Val Gin Leu Arg Ala Asp Tyr Arg Ser Ala Arg Leu Ala 

690 695 700 

Arg His Phe Ser 
705 



<210> 370 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<400> 370 

gtcaatcact ctcccagcat aagcacccca gcccactcta ttccagggag t cat get at g 60 
<210> 371 

<211> 60 i 

<212> DNA 

<213> Homo sapiens 



<400> 371 
agtagaattt 



cctctggaac tggagacatt ttccagcaac atattcagct tgaaagtaca 60 
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<210> 372 
<211> 60 
<<212> DNA 
<213> Homo sapiens 

<400> 372 

ccagagactg gagatcctgt tacgctgaga ctccttgatg atggagcagg tgctgatgtt 60 

<210> 373 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<400> 373 

ttacagtctg ctgtatctaa cattgcccag gcgcctctgt ttattccccc caattctgat 60 

<210> 374 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<400> 374 

gctgtgcccc cagccactgt ggaagccttt gtggaaagag acagcctcca ttttcctcat 60 

<210> 375 

<211> 60 

<212> DNA 

<213> Homo sapiens 

<400> 375 

aaaaacacag tgactgtgga taatactgtg ggcaacgaca ctatgtttct agttacgtgg 60 

i 

<210> 376 

<211> 20 

<212> PRT 

<213> Homo sapiens 

<400> 376 

Leu Gin Ser Ala Val Ser Asn lie Ala Gin Ala Pro Leu Phe lie Pro 

1 5 10 15 

Pro Asn Ser Asp 
20 



<210> 377 
<211> 20 
<212> PRT 

<213> Homo sapiens 



Val Asn His Ser Pro Ser lie Ser Thr Pro Ala His Ser H e Pro Gly 
1 5 10 u 



Ser His Ala Met 
20 
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/ 



"<210> 378 
<211> 20 
<212> PRT 

<213> Homo sapiens 



<400> 378 

Pro GlU Thr Gly Asp Pro Val Thr Leu Arg Leu Leu Asp Asp Gly Ala 

15 10 15 

Gly Ala Asp Val 
20 



<210> 379 
<211> 20 
<212> PRT 

<213> Homo sapiens 



<400> 379 

Ala Val Pro Pro Ala Thr Val Glu Ala Phe Val Glu Arg Asp Ser Leu 

1 5 10 15 

His Phe Pro His 
20 



<210> 380 
<211> 20 
<212> PRT 

<213> Homo sapiens 



<400> 380 

Ser Arg He Ser Ser Gly Thr Gly Asp He Phe Gin Gin His He Gin 

15 10 15 

Leu Glu Ser Thr ■ 
20 



<210> 381 
<211> 20 
<212> PRT 

<213> Homo sapiens 



<400> 381 

Lys Asn Thr Val Thr Val Asp Asn Thr Val Gly Asn Asp Thr Met Phe 

1 5 10 15 

Leu Val Thr Trp 
20 



<210> 382 , 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 382 

Lys Pro Gly His Trp Thr Tyr Thr Leu Asn Asn Thr His His Ser Leu 
15 10 15 
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Gin Ala Leu Lys 
20 



<210> 383 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer 
<400> 383 

cggcgaattc atggattggg ggacgctgc 29 

<210> 384 
<211> 35 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer 
<400> 384 

cggcctegag tcacccctct atccgaacct tctgc 35 

<210> 385 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer 
<400> 385 

cggcgaattc cacgaaccac tcgcaagttc ag 32 

<210> 386 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer 
<400> 386 

cggctcgagt tagcttgggc ctgtgattgc 30 

<210> 387 
<211> 20 
<212> PRT 

<213> Homo sapiens ! 
<400> 387 

Phe Phe Lys Trp Leu Leu Ser Cys Cys Pro Gly Ser Ser Gin lie Ala 

1- 5 10 15 

Ala Ala Ala Ser 
20 
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<210> 388 
<211> 19 
<212> PRT 

<213> Homo sapiens 
<400> 388 

Leu Ser Cys Cys Pro Gly Ser Ser Gin lie Ala Ala Ala Ser Thr Gin 

15 10 15 

Pro Glu Asp 



<210> 389 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 389 

Ala Ala Ala Ala Ser Thr Gin Pro Glu Asp Asp lie Asn Thr Gin Arg 

1 5 10 15 

Lys Lys Ser Gin 
20 



<210> 390 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 390 

Thr Gin Pro Glu Asp Asp lie Asn Thr Gin Arg Lys Lys Ser Gin Glu 

15 10 15 

Lys Met Arg Glu 
20 



<210> 391 
<211> 20 • 
<212> PRT 

<213> Homo sapiens 
<400> 391 

Asp lie Asn Thr Gin Arg Lys Lys Ser Gin Glu Lys Met Arg Glu Val ' 

1 5 10 15 

Thr Asp Ser Pro 
20 



<210> 392 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 392 

Arg Lys Lys Ser Gin Glu Lys Met Arg Glu Val Thr Asp Ser Pro Gly 
1 5 10 15 
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Arg Pro Arg Glu 
20 



<210> 393 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 393 

Glu Lys Met Arg Glu Val Thr Asp Ser Pro Gly Arg Pro Arg Glu Leu 

1 5 10 15 

Thr lie Pro Gin 

20 



<210> 394 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 394 

Val Thr Asp Ser Pro Gly Arg Pro Arg Glu Leu Thr lie Pro Gin Thr 

1 5 io 15 

Ser Ser His Gly 
20 



<210> 395 
<211> 19 
<212> PRT 

<213> Homo sapiens 



<400> 395 

Gly Arg Pro Arg Glu Leu Thr He 

1 5 
Asn Arg Phe 



Pro Gin Thr Ser Ser His Gly Ala 
10 15 



<210> 396 
<211> 19 
<212> PRT 

<213> Homo sapiens 



<400> 396 

Met Asn Lys Leu Tyr He Gly Asn 

1 5 
Asp Leu Glu 



Leu Ser Glu Asn Ala Ala Pro Ser 
10 15 



<210> 397 
<211> 20 
<212> PRT 

<213> Homo sapiens 



<400> 397 
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Ser Glu Asn Ala Ala Pro Ser Asp Leu Glu Ser He Phe Lys Asp Ala 
. 1 5 10 15 

Lys He Pro Val 
20 



<210> 398 

<211> 20 

<212> PRT 

<213> Homo sapiens 

<400> 398 

Ser He Phe Lys Asp Ala Lys He Pro Val Ser Gly Pro Phe Leu Val 

15 10 15 

Lys Thr Gly Tyr 
20 



<210> 399 

<211> 20 

<212> PRT 

<213> Homo sapiens 

<400> 399 

Ser Gly Pro Phe Leu Val Lys Thr Gly Tyr Ala Phe Val Asp Cys Pro 

15 10 15 

Asp Glu Ser Trp 
20 



<210> 400 

<211> 20 

<212> PRT 

<213> Homo sapiens 

<400> 400 

Ala Phe Val Asp Cys Pro Asp Glu Ser Trp Ala Leu Lys Ala He Glu 

1 5 10 15 

Ala Leu Ser Gly 
20 



<210> 401 

<211> 20 

<212> PRT 

<213> Homo sapiens 

<400> 401 

Ala Leu Lys Ala He Glu Ala Leu Ser Gly Lys He Glu Leu His Gly 

1 5 10 15 

Lys Pro lie Glu 1 
20 



<210> 402 

<211> 20 

<212> PRT 

<213> Homo sapiens 
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,<400> 402 

n Lys He Glu Leu His Gly Lys Pro He Glu Val Glu His Ser Val Pro 

15 10 15 

Lys Arg Gin Arg 
20 



<210> 403 

<211> 20 

<212> PRT 

<213> Homo sapiens 

<400> 403 

Val Glu His Ser Val Pro Lys Arg Gin Arg He Arg Lys Leu Gin He 

1 5 10 " 15 

Arg Asn He Pro 
20 



<210> 404 

<211> 20 

<212> PRT 

<213> Homo sapiens 

<400> 404 

He Arg Lys Leu Gin He Arg Asn He Pro Pro His Leu Gin Trp Glu 

15 10 15 

Val Leu Asp Ser 
20 



<210> 405 

<211> 20 

<212> PRT 

<213> Homo sapiens 



<400> 405 

Ala Val Val Asn Val Thr Tyr Ser 

1 5 
Leu Asp Lys Leu 
20 



Ser Lys Asp Gin Ala Arg Gin Ala 
10 15 



<210> 406 

<211> 20 

<212> PRT 

<213> Homo sapiens 



<400> 406 

Asp Gin Ala Arg Gin Ala Leu Asp Lys Leu Asn Gly Phe Gin Leu Glu 

1 5 io 15 

Asn Phe Thr Leu 
20 



<210> 407 
<211> 20 
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<212> PRT 

.<213> Homo sapiens 
<400> 407 

Asn Gly Phe Gin Leu Glu Asn Phe Thr Leu Lys Val Ala Tyr lie Pro 

1 5 10 15 

Asp Glu Thr Ala 
20 



<210> 408 

<211> 20 

<212> PRT 

<213> Homo sapiens 

<400> 408 

Lys Val Ala Tyr lie Pro Asp Glu Thr Ala Ala Gin Gin Asn Pro Leu 

15 10 15 

Gin Gin Pro Arg 
20 



<210> 409 

<211> 20 

<212> PRT 

<213> Homo sapiens 

<400> 409 

Ala Gin Gin Asn Pro Leu Gin Gin Pro Arg Gly Arg Arg Gly Leu Gly 

15 10 15 

Gin Arg Gly Ser 
20 



<210> 410 » 

<211> 20 

<212> PRT 

<213> Homo sapiens 

<400> 410 

Gly Arg Arg Gly Leu Gly Gin Arg Gly Ser Ser Arg Gin Gly Ser Pro 

15 10 15 

Gly Ser Val Ser 
20 



<210> 411 

<211> 20 

<212> PRT 

<213> Homo sapiens 

<400> 411 

Ser Arg Gin Gly Ser Pro Gly Ser Val Ser Lys Gin Lys Pro Cys Asp 

1 5 * 10 15 

Leu Pro Leu Arg 
20 
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<210> 412 
<211> 20 
* <212> PRT 
<213> Homo sapiens 

<400> 412 

Lys Gin Lys Pro Cys Asp Leu Pro Leu Arg Leu Leu Val Pro Thr Gin 

1 5-io 15 

Phe Val Gly Ala 
f 20 



<210> 413 

<211> 20 

<212> PRT 

<213> Homo sapiens 

<400> 413 

Leu Leu Val Pro Thr Gin Phe Val 

1 ' 5 

Ala Thr lie Arg 
20 



Gly Ala He He Gly Lys Glu Gly 
10 15 



<210> 414 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 414 

He He Gly Lys Glu Gly Ala Thr He Arg Asn He Thr Lys Gin Thr 

1 5 10 15 

Gin Ser Lys He 
20 



<210> 415 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 415 

Asn He Thr Lys Gin Thr Gin Ser Lys He Asp Val His Arg Lys Glu 

1 5 io 15 

Asn Ala Gly Ala 
20 



<210> 416 

<211> 20 

<212> PRT 

<213> Homo sapiens 

<400> 416 

Asp Val His Arg Lys Glu Asn Ala Gly Ala Ala Glu Lys Ser He Thr 
5 10 15 

He Leu Ser Thr 
20 
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; <210> 417 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 417 

Ala Glu Lys Ser He Thr He Leu Ser Thr Pro Glu Gly Thr Ser Ala 

15 10 15 

Ala Cys Lys Ser 
20 



<210> 418 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 418 

Pro Glu Gly Thr Ser Ala Ala Cys Lys Ser He Leu Glu He Met His 

1 5 10 15 

Lys Glu Ala Gin 
20 



<210> 419 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 419 

He Leu Glu He Met His Lys Glu Ala Gin Asp He Lys Phe Thr Glu 

1 5 10 15 

Glu He Pro Leu 
20 



<210> 420 
<211> 455 
<212> DNA 

<213> Homo sapiens 
.<:400> 420 

gaagacatgc ttacttcccc ttcaccttcc 
gccctagcca acgccgcatg agagggagtg 
catctagaaa gaagcgctta agatgtggca 
ttgccctggg agttctcaaa ttgctgcagc 
tacacagagg aagaagagtc aggaaaagat 
ccgagagctt accattcctc agacttcttc 
taaagctcta gaggccgtca aattggcaat 
acatgtttac aataatgagg agcaggttgg 

<210> 421 
<211> 24 
<212> DKA 

<213> Artificial Sequence 



ttcatgatgt gggaagagtg ctgcaaccca 60 
tgccgagggc ttctgagaag gtttctctca 120 
gcccctcttc ttcaagtggc tcttgtcctg 180 
agcctccacc cagcctgagg atgacatcaa 240 
gagagaagtt acagactctc ctgggcgacc 300 
acatggtgct aacagatttg ttcctaaaag 360 
agaagccggg ttccaccata ^tgattctgc 420 
actgg " 455 
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<220> 

<223> PCR primer 
<400> 421 

actagtgtcc gcgtggcggc ctac 24 

<210> 422 
<211> 34 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer 
<400> 422 

catgagaatt catcacatgc ccttgaaggc tccc 34 

<210> 423 
<211> 161 
<212> PRT 

<213> Homo sapiens 
<400> 423 

Met Gin His His His His His His His Thr Ser Val Arg Val Ala Ala 

1 5 10 15 

Tyr Phe Glu Asn Phe Leu Ala Ala Trp Arg Pro Val Lys Ala Ser Asp 

20 25 30 

Gly Asp Tyr Tyr Thr Leu Ala Val Pro Met Gly Asp Val Pro Met Asp 

35 40 45 

Gly He Ser Val Ala Asp He Gly Ala Ala Val Ser Ser He Phe Asn 

50 55 60 

Ser Pro Glu Glu Phe Leu Gly Lys Ala Val Gly Leu Ser Ala Glu Ala 
65 ™ 75 80 

Leu Thr He Gin Gin Tyr Ala Asp Val Leu Ser Lys Ala Leu Gly Lvs 

85 90 95 

Glu Val Arg Asp Ala Lys He Thr Pro Glu Ala Phe Glu Lys Leu Gly 

100 105 no 

Phe Pro Ala Ala Lys Glu lie Ala Asn Met Cys Arg Phe Tyr Glu Met 

115 120 125 

Lys Pro Asp Arg Asp Val Asn Leu Thr His Gin Leu Asn Pro Lvs Val 

136 135 140 

Lys Ser Phe Ser Gin Phe He Ser Glu Asn Gin Gly Ala Phe Lys Glv 
145 150 155 i 6 o 

Met 



<210> 424 
<211> 489 
<212> DNA 

<213> Homo sapiens 
<400> 424 

atgcagcatc accaccatca ccaccacact 
tttctcgcgg cgtggcggcc cgtgaaagcc 
ccgatgggag atgtaccaat ggatggtatc 
agcattttta attctccaga ggaattttta 
ctaacaatac agcaatatgc tgatgttttg 



agtgtccgcg tggcggccta ctttgaaaac 60 
tctgatggag attactacac cttggctgta 120 
tctgttgctg atattggagc agccgtctct 180 
ggcaaggccg tggggctcag tgcagaagca 240 
tccaaggctt tggggaaaga agtccgagat 300 
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gcaaagatta ccccggaagc tttcgagaag ctgggattcc ctgcagcaaa ggaaatagcc 360 
aatatgtgtc gtttctatga aatgaagcca gaccgagatg tcaatctcac ccaccaacta 420 
aatcccaaag tcaaaagctt cagccagttt atctcagaga accagggagc cttcaagggc 480 
atgtgatga 489 

<210> 425 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer 
<400> 425 

aacaaactgt atatcggaaa cctcagcgag aa 32 

<210> 426 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer 
<400> 426 

ccatagaatt cattacttcc gtcttgactg agg 33 

<210> 427 
<211> 586 
<212> PRT 

<213> Homo sapiens 



<400> 427 



Met 


Gin 


His 


His 


His 


His 


His His 


Asn 


Lys Leu Tyr He Gly Asn Leu 


1 








5 








10 , 15 


Ser 


Glu 


Asn 


Ala 


Ala 


Pro 


Ser Asp 


Leu 


Glu Ser He Phe Lys Asp Ala 








20 








25 


30 


Lys 


He 


Pro 


Val 


Ser 


Gly 


Pro Phe 


Leu 


Val Lys Thr Gly Tyr Ala Phe 






35 








40 




45 


Val 


Asp 


Cys 


Pro 


Asp 


Glu 


Ser Trp 


Ala 


Leu Lys Ala He Glu Ala Leu 




50 










55 




60 


Ser 


Gly 


Lys 


He 


Glu 


Leu 


His Gly 


Lys 


Pro He Glu Val Glu His Ser 


65 










70 






75 80 


Val 


Pro 


Lys 


Arg 


Gin 


Arg 


He. Arg 


Lys 


Leu Gin He Arg Asn He Pro 










85 








90 95 


Pro 


His 


Leu 


Gin 


Trp 


Glu 


Val Leu 


Asp 


Ser Leu Leu Val Gin Tyr Gly 








100 








105 


110 


Val 


Val 


Glu 


Ser 


Cys 


Glu 


Gin Val 


Asn 


Thr Asp Ser Glu Thr Ala Val 






115 








120 




125 


Val 


Asn 


Val 


Thr 


Tyr 


Ser 


Ser Lys 


Asp 


Gin Ala Arg Gin Ala Leu Asp 




130 










135 




140 , 


Lys 


Leu 


Asn 


Gly 


Phe 


Gin 


Leu Glu 


Asn 


Phe Thr Leu Lys Val Ala Tyr 


145 










150 






155 160 


lie 


Pro 


Asp 


Glu 


Thr 


Ala 


Ala Gin 


Gin 


Asn Pro Leu Gin Gin Pro Arg 










165 








170 175 


Gly 


Arg 


Arg 


Gly 


Leu 


Gly 


Gin Arg 


Gly 


Ser Ser Arg Gin Gly Ser Pro 








180 








185 


190 


Gly 


Ser 


Val 


Ser 


Lys 


Gin 


Lys Pro 


Cys 


Asp Leu Pro Leu Arg Leu Leu 
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Val 

He 
225 
Lys 

Pro 

Lys 

Leu 

Asn 
305 
Pro 

Lys 

Lys 

Ala 

Pro 
385 
Thr 

Leu 

Gin 

Ala 

Gly 
4 65 
He 

Ala 

Gly 

Val 



195 
Pro Thr Gin 
210 

Arg Asn lie 
Glu Asn Ala 



Phe Val Gly 
215 

Thr Lys Gin 

230 
Gly Ala Ala 
245 

Ser Ala Ala 



Glu Gly Thr 
260 

Glu Ala Gin Asp He Lys 

275 
Ala His Asn 
290 

Leu Lys Lys 



Leu Gin Glu 

Gly Asn Val 
340 

He Arg Glu 

355 
His Leu He 
370 

Thr Ser Gly 

Pro Pro Tyr 

Phe He Pro 
420 

His He Lys 

435 
Pro Ala Glu 
450 

Pro Pro Glu 



Asn Phe Val 
.295 

He Glu Gin 

310 
Leu Thr Leu 
325 

Glu Thr Cys 



200 

Ala He He 

Thr Gin Ser 

Glu Lys Ser 
250 

Cys Lys Ser 

265 
Phe Thr Glu 
280 

Gly Arg Leu 



205 

Gly Lys Glu 

220 
Lys He Asp 
235 

He Thr He 
He Leu Glu 



Gly Ala Thr 

Val His Arg 
240 

Leu Ser Thr 

255 
He Met His 
270 

Leu Lys He 



Glu He Pro 
285 

He Gly Lys Glu Gly Arg 
300 

Asp Thr Asp Thr Lys He 
315 

Glu Arg Thr 



Tyr Asn Pro 
330 

Ala Lys Ala Glu Glu Glu 
345 

Ser Tyr Glu Asn Asp He 
360 

Asn Leu Asn 



Pro Gly Leu 
375 

Met Pro Pro 

390 
Pro Gin Phe 
405 

Ala Leu Ser 
Gin Leu Ser 



Pro Thr Ser 

Glu Gin Ser 
410 

Val Gly Ala 

425 
Arg Phe Ala 
440 

Ala Lys Val 



Ala Pro Asp 
455 

Ala Gin Phe Lys Ala Gin 
470 

Lys Glu Glu Asn Phe Val 
485 

Val Pro Ser 



Lys 
545 
Gin 

Gin 



His He Arg 
500 

Gly Lys Thr 

515 
Val Pro Arg 
530 

He Thr Gly 

Glu He Leu 

Ser Gly Pro 
580 



Val Asn Glu 

i 

Asp Gin Thr 
535 

His Phe Tyr 

550 
Thr Gin Val 
565 

Pro Gin Ser 



Ser Pro Lys 
490 

Phe Ala Ala 

505 
Leu Gin Asn 
520 

Pro Asp Glu 



Ala Ser Met 
365 

Ala Leu Gly 

380 
Gly Pro Pro 
395 

Glu Thr Glu 

He He Gly 

Gly Ala Ser 
445 

Arg Met Val 

460 
Gly Arg He 
475 

Glu Glu Val 
Gly Arg Val 



Thr He Ser 
320 

He Thr Val 

335 
He Met Lys 
350 

Asn Leu Gin 

Leu Phe Pro 

Ser Ala Met 
400 

Thr Val His 

415 
Lys Gin Gly 
J 30 

He Lys He 

He He Thr 

Tyr Gly Lys 
480 

Lys Leu Glu 
495 

lie Gly Lys 
510 

Ala Glu Val 



Leu Ser Ser 
525 

Asn Asp Gin Val Val Val 
540 

Ala Cys Gin Val Ala Gin 
555 

Gin Gin Gin 



Lys Gin His 
570 

Arg Arg Lys 
585 



Arg Lys He 
560 

Lys Ala Leu 
575 



<210> 428 1 

<211> 1764 

<212> DNA 

<213> Homo sapiens 

<400> 428 

atgcagcatc accaccatca ccacaacaaa ctgtatatcg gaaacctcag cgagaacgcc 60 
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gccccctcgg acctagaaag tatcttcaag gacgccaaga tcccggtgtc gggacccttc 120 
ctggtgaaga ctggctacgc gttcgtggac tgcccggacg agagctgggc cctcaaggcc 180 
^atcgaggcgc tttcaggtaa aatagaactg cacgggaaac ccatagaagt tgagcactcg 240 
gtcccaaaaa ggcaaaggat tcggaaactt cagatacgaa atatcccgcc tcatttacag 300 
tgggaggtgc tggatagttt actagtccag tatggagtgg tggagagctg tgagcaagtg 360 
aacactgact cggaaactgc agttgtaaat gtaacctatt ccagtaagga ccaagctaga 420 
caagcactag acaaactgaa tggatttcag ttagagaatt tcaccttgaa agtagcctat 480 
atccctgatg aaacggccgc ccagcaaaac cccttgcagc agccccgagg tcgccggggg 540 
cttgggcaga ggggctcctc aaggcagggg tctccaggat ccgtatccaa gcagaaacca 600 
tgtgatttgc ctctgcgcct gctggttccc acccaatttg ttggagccat cataggaaaa 660 
gaaggtgcca ccattcggaa catcaccaaa cagacccagt ctaaaatcga tgtccaccgt 720 
aaagaaaatg cgggggctgc tgagaagtcg attactatcc tctctactcc tgaaggcacc 780 
tctgcggctt gtaagtctat tctggagatt atgcataagg aagctcaaga tataaaattc 840 
acagaagaga tccccttgaa gattttagct cataataact ttgttggacg tcttattggt 900 
aaagaaggaa gaaatcttaa aaaaattgag caagacacag acactaaaat cacgatatct 960 
ccattgcagg aattgacgct gtataatcca gaacgcacta ttacagttaa aggcaatgtt 1020 
gagacatgtg ccaaagctga ggaggagatc atgaagaaaa tcagggagtc ttatgaaaat 1080 
gatattgctt ctatgaatct tcaagcacat ttaattcctg gattaaatct gaacgccttg 1140 
ggtctgttcc cacccacttc agggatgcca cctcccacct cagggccccc ttcagccatg 1200 
actcctccct acccgcagtt tgagcaatca gaaacggaga ctgttcatct gtttatccca 1260 
gctctatcag tcggtgccat catcggcaag cagggccagc acatcaagca gctttctcgc 1320 
tttgctggag cttcaattaa gattgctcca gcggaagcac cagatgctaa agtgaggatg 1380 
gtgattatca ctggaccacc agaggctcag ttcaaggctc agggaagaat ttatggaaaa 1440 
attaaagaag aaaactttgt tagtcctaaa gaagaggtga aacttgaagc tcatatcaga 1500 
gtgccatcct ttgctgctgg cagagttatt ggaaaaggag gcaaaacggt gaatgaactt 1560 
cagaatttgt caagtgcaga agttgttgtc cctcgtgacc agacacctga tgagaatgac 1620 
caagtggttg tcaaaataac tggtcacttc tatgcttgcc aggttgccca gagaaaaatt 1680 
caggaaattc tgactcaggt aaagcagcac caacaacaga aggctctgca aagtggacca 1740 
cctcagtcaa gacggaagta atga 1764 

<210> 429 
<211> 35 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer 
<400> 429. 

ccatggaatt cattatttca atataagata atctc 35 

i 

<210> 430 
<211> 881 
<212> PRT 
<213> Homo sapiens 

<400> 430 

Met Gin His His His His His His Gly Val Gin Leu Gin Asp Asn Gly 

15 10 15 

Tyr Asn Gly Leu Leu lie Ala He Asn Pro Gin Val Pro Glu Asn Gin 

20 25 30 f 

Asn Leu He Ser Asn He Lys Glu Met He Thr Glu Ala Ser Phe Tyr 

35 40 45 

Leu Phe Asn Ala Thr Lys Arg Arg Val Phe Phe Arg Asn He Lys He 

50 55 60 

Leu He Pro Ala Thr Trp Lys Ala Asn Asn Asn Ser Lys He Lys Gin 
65 70 75 ^ 80 

Glu Ser Tyr Glu Lys Ala Asn Val He Val Thr Asp Trp Tyr Gly Ala 
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His Gly 

Gly Lys 

Thr Ala 
130 
His Leu 
145 

Tyr He 

He Thr 

Cys He 

Ser Thr 
210 
Ser Val 
225 

Asn Leu 

Thr Asp 

Leu Pro 

Val Cys 
290 
Leu Leu 
305 

Glu He 

He Arg 

Leu Val 

He Cys 
370 
Gly Lys 
385 

Lys Leu 



Asp Asp 
100 
Tyr He 
115 

Gly Tyr 

Arg Tip 

Asn Gly 

Gly He 
180 
He Ser 
195 

Gin Asn 

Val Glu 

Gin Asn 

Ser Ala 
260 
Pro Pro 
275 

Leu Val 

Gin Leu 

His Thr 

Ala Gin 
340 
Ser Tyr 
355 

Ser Gly 



85 

Pro Tyr 

His Phe 

Gly Ser 

Gly Val 
150 
Gin Asn 
165 

Phe Val 

Lys Leu 

Ala Thr 

Phe Cys 
230 
Gin Met 
245 

Asp Phe 

Pro Thr 

Leu Asp 

Gin Gin 
310 
Phe Val 
325 

Leu His 
Leu Pro 
Leu Lys 



Ala Tyr 
Leu Gly 



He His 

Leu Ser 

Asn Ser 
450 
Gly Asp 
465 

Val Lys 

Val Gly 

Pro Glu 

Asn Phe 
530 
Gly Thr 



Ser He 
420 
Arg Leu 
435 

Asn Ser 

He Phe 

Pro His 

Asn Asp 
500 
He He 
515 . 
He Thr 



Gly Ser 
390 
Asn Cys 
405 

Ala Leu 

Thr Gly 

Met He 

Gin Gin 
470 
His Gin 
485 

Thr Met 



Thr Leu 

Thr Pro 
120 
Arg Gly 
135 

Phe Asp 

Gin He 

Cys Glu 

Phe Lys 
200 
Ala Ser 
215 

Asn Ala 

Cys Ser 

His His 

Phe Ser 
280 
Val Ser 
295 

Ala Ala 

Gly He 

Gin He 

Thr Thr 
360 
Lys Gly 
375 

Val Met 



Leu Phe 
Asn Leu 
Ala Lys Pro Gly 



Leu Pro 

Gly Ser 

Gly Leu 
• 440 
Asp Ala 
455 

His He 

Leu Lys 

Phe Leu 

Asp Pro 
520 
Thr Phe 
535 

His Trp 



90 

Gin Tyr Arg 
105 

Asn Phe Leu 

Arg Val Phe 

Glu Tyr Asn 
155 

Lys Val Thr 

170 
Lys Gly Pro 
185 

Glu Gly Cys 

He Met Phe 

Ser Thr His 
235 

Leu Arg Ser 

250 
Ser Phe Pro 
265 

Leu Val Glu 

Ser Lys Met 

Glu Phe Tyr 
315 

Ala Ser Phe 

330 
Asn Ser Asn 
345 

Val Ser Ala 

t 

Phe Glu Val 

He lieu Val 
395 

Thr Val Leu 

410 
Ser Ala Ala 
425 

Lys Phe Phe 



Gly Cys 

Leu Asn 
125 
Val His 
140 

Asn Asp 

Arg Cys 

Cys Pro 

Thr Phe 
205 
Met Gin 
220 

Asn Gin 

Ala Trp 

Met Asn 

Ala Gly 
285 
Ala Glu 
300 

Leu Met 

Asp Ser 

Asp Asp 

Lys Thr 
365 
Val Glu 
380 

Thr Ser 
Ser Ser 
Pro Asn 



Phe Ser Arg 

Gin Leu Glu 
475 

Asn Thr Val 
490 

Val Thr Trp 
505 

Asp Gly Arg 
Arg Thr Ala 
Thr Tyr Thr 



Val Pro 
445 
He Ser 
460 

Ser Thr 

Thr Val 

Gin Ala 

Lys Tyr 
525 
Ser Leu 
540 

Leu Asn 



95 

Gly Lys Glu 
110 

Asp Asn Leu 

Glu Trp Ala 

Lys Pro Phe 
160 

Ser Ser Asp 

175 
Gin Glu Asn 
190 

He Tyr Asn 

Ser Leu Ser 

Glu Ala Pro 
240 

Asp Val He 

255 
Gly Thr Glu 
270 

Asp Lys Val 

Ala Asp Arg 

Gin lie Val 
320 

Lys Gly Glu 

335 
Arg Lys Leu 
350 

Asp He Ser 

Lys Leu Asn 

Gly Asp Asp 
400 

Gly Ser Thr 

415 
Leu Glu Glu 
430 

Asp He Ser 

Ser Gly Thr 

Gly Glu Asn 
480 

Asp Asn Thr 

i495 
Ser Gly Pro 
510 

Tyr Thr Asn 
Trp He Pro 
Asn Thr His 
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545 
His 

Ser 

Leu 

Phe 

Thr 
625 
Asp 

Ala 

Ser 

Val 

Lys 
705 
Val 

His 

Lys 

Phe 

Leu 
785 
Ser 

Ser 

Thr 

Asn 

lie 
865 
Lys 



Ser Leu Gin 

Ala Val Pro 
580 

His Phe Pro 

595 
Tyr Pro lie 
610 

Gly Asp Pro 

Val lie Lys 

Ala Asn Gly 
660 

lie Ser Thr 
675 

Pro Gly Tyr 
690 

Ser Val Gly 

Ser Ser Gly 

Pro Asp Val 
740 

Val Glu Glu 

755 
Asp Gin Gly 
770 

Gin Asn lie 

Lys Arg Asn 

Pro Gin He 
820 

His Glu Ser 

835 
Ser Leu Gin 
850 

Pro Pro Asn 



550 
Ala Leu 
565 

Pro Ala 

His Pro 

Leu Asn 

Val Thr 
630 
Asn Asp 
645 

Arg Tyr 

Pro Ala 

Thr Ala 

Arg Asn 
710 
Gly Ser 
725 

Phe Pro 

Glu Leu 

Gin Ala 

Gin Asp 
790 
Pro Gin 
805 

Ser Thr 

His Arg 

Ser Ala 

Ser Asp 
870 



Lys Val 

Thr Val 

Val Met 
600 
Ala Thr 
615 

Leu Arg 

Gly He 

Ser Leu 

His Ser 
680 
Asn Gly 
695 

Glu Glu 

Phe Ser 

Pro Cys 

Thr Leu 
760 
Thr Ser 
775 

Asp Phe 

Gin Ala 

Asn Gly 

He Tyr 
840 
Val Ser 
855 

Pro Val 



Thr Val 
570 
Glu Ala 
585 

He Tyr 

Val Thr 

Leu Leu 

Tyr Ser 
650 
Lys Val 
665 

He Pro 

Asn He 

Glu Arg 

Val Leu 
730 
Lys He 
745 

Ser Trp 

Tyr Glu 

Asn Asn 

Gly He 
810 
Pro Glu 
825 

Val Ala 
Asn He 
Pro Ala 



555 

Thr Ser 

Phe Val 

Ala Asn 

Ala Thr 
620 
Asp Asp 
635 

Arg Tyr 

His Val 

Gly Ser 

Gin Met 
700 
Lys Trp 
715 

Gly Val 

He Asp 

Thr Ala 

He Arg 
780 
Ala He 
795 

Arg Glu 



Arg Ala 

Glu Arg 
590 
Val Lys 
605 

Val Glu 

Gly Ala 

Phe Phe 

Asn His 
670 
His Ala 
685 

Asn Ala 

Gly Phe 

Pro Ala 

Leu Glu 
750 
Pro Gly 
765 

Met Ser 
Leu Val 
He Phe 



His Gin Pro Asn 
, 830 
lie Arg Ala Met 
845 

Ala Pro 



Ala Gin 
860 
Arg Asp 
875 



Tyr Leu 



560 
Ser Asn 
575 

Asp Ser 

Gin Gly 

Pro Glu 

Gly Ala 
640 
Ser Phe 
655 

Ser Pro 

Met Tyr 

Pro Arg 

Ser Arg 
720 
Gly Pro 
735 

Ala* Val 

Glu Asp 

Lys Ser 

Asn Thr 
800 
Thr Phe 
815 

Gly Glu 

Asp Arg 

Leu Phe 

He Leu 
880 



<210> 431 

<211> 2646 

<212> DNA 

<213> Homo sapiens 

<400> 431 , 

atgcagcatc accaccatca ccacggagta cagcttcaag acaatgggta taatggattg 60 

ctcattgcaa ttaatcctca ggtacctgag aatcagaacc tcatctcaaa cattaaggaa 120 

atgataactg aagcttcatt ttacctattt aatgctacca agagaagagt atttttcaga 180 

aatataaaga ttttaatacc tgccacatgg aaagctaata ataacagcaa aataaaacaa 240 

gaatcatatg aaaaggcaaa tgtcatagtg actgactggt atggggcaca tggagatgat 300 

ccatacaccc tacaatacag agggtgtgga aaagagggaa aatacattca tttcacacct 360 

aatttcctac tgaatgataa cttaacagct ggctacggat cacgaggccg agtgtttgtc 420 
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catgaatggg cccacctccg ttggggtgtg ttcgatgagt ataacaatga caaacctttc 480 
tacataaatg ggcaaaatca aattaaagtg acaaggtgtt catctgacat cacaggcatt 540 
<tttgtgtgtg aaaaaggtcc ttgcccccaa gaaaactgta ttattagtaa gctttttaaa 600 
gaaggatgca cctttatcta caatagcacc caaaatgcaa ctgcatcaat aatgttcatg 660 
caaagtttat cttctgtggt tgaattttgt aatgcaagta cccacaacca agaagcacca 720 
aacctacaga accagatgtg cagcctcaga agtgcatggg atgtaatcac agactctgct 780 
gactttcacc acagctttcc catgaacggg actgagcttc cacctcctcc cacattctcg 840 
cttgtagagg ctggtgacaa agtggtctgt ttagtgctgg atgtgtccag caagatggca 900 
gaggctgaca gactccttca actacaacaa gccgcagaat tttatttgat gcagattgtt 960 
gaaattcata ccttcgtggg cattgccagt ttcgacagca aaggagagat cagagcccag 1020 
ctacaccaaa ttaacagcaa tgatgatcga aagttgctgg tttcatatct gcccaccact 1080 
gtatcagcta aaacagacat cagcatttgt tcagggctta agaaaggatt tgaggtggtt 1140 
gaaaaactga atggaaaagc ttatggctct gtgatgatat tagtgaccag cggagatgat 1200 
aagcttcttg gcaattgctt acccactgtg ctcagcagtg gttcaacaat tcactccatt 1260 
gccctgggtt catctgcagc cccaaatctg gaggaattat cacgtcttac aggaggttta 1320 
aagttctttg ttccagatat atcaaactcc aatagcatga ttgatgcttt cagtagaatt 1380 
tcctctggaa ctggagacat tttccagcaa catattcagc ttgaaagtac aggtgaaaat 1440 
gtcaaacctc accatcaatt gaaaaacaca gtgactgtgg ataatactgt gggcaacgac 1500 
actatgtttc tagttacgtg gcaggccagt ggtcctcctg agattatatt atttgatcct 1560 
gatggacgaa aatactacac aaataatttt atcaccaatc taacttttcg gacagctagt 1620 
ctttggattc caggaacagc taagcctggg cactggactt acaccctgaa caatacccat 1680 
cattctctgc aagccctgaa agtgacagtg acctctcgcg cctccaactc agctgtgccc 1740 
ccagccactg tggaagcctt tgtggaaaga gacagcctcc attttcctca tcctgtgatg 1800 
atttatgcca atgtgaaaca gggattttat cccattctta atgccactgt cactgccaca 1860 
gttgagccag agactggaga tectgttacg ctgagactcc ttgatgatgg agcaggtgct 1920 
gatgttataa aaaatgatgg aatttactcg aggtattttt tctcctttgc tgcaaatggt 1980 
agatatagct tgaaagtgca tgtcaatcac tctcccagca taagcacccc agcccactct 2040 
attccaggga gtcatgctat gtatgtacca ggttacacag caaacggtaa tattcagatg 2100 
aatgctccaa ggaaatcagt aggcagaaat gaggaggagc gaaagtgggg ctttagccga 2160 
gtcagctcag gaggctcctt ttcagtgctg ggagttccag ctggccccca ccctgatgtg 2220 
tttccaccat gcaaaattat tgacctggaa gctgtaaaag^ tagaagagga attgacccta 2280 
tcttggacag cacctggaga agactttgat cagggccagg ctacaagcta tgaaataaga 2340 
atgagtaaaa gtctacagaa tatccaagat gactttaaca atgctatttt agtaaataca 2400 
tcaaagcgaa atcctcagca agctggcatc agggagatat ttacgttctc accccaaatt 2460 
tccacgaatg gacctgaaca tcagccaaat ggagaaacac atgaaagcca cagaatttat 2520 
gttgcaatac gagcaatgga taggaactcc ttacagtctg ctgtatctaa cattgcccag 2580 
gcgcctctgt ttattccccc caattctgat cctgtacctg ccagagatta. tcttatattg 2640 
aaataa 2646 

<210> 432 
<211> 36 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PGR primer 
<400> 432 

cgcctgctcg agtcattaat attcatcaga aaatgg 35 
<210> 433 

<2U> 371 ? 
<212> PRT 

<213> Homo sapiens 
<400> 433 

Met Gin His His His .His His His Trp Gin Pro Leu Phe Phe Lys Trp 
1 5 10 15 
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Leu 


Leu 


Ser 


Cys Cys 


Pro Gly Ser Ser Gin lie 


Ala Ala Ala Ala Ser 








20 


25 


30 


Thr 


Gin 


Pro 


Glu Asp 


Asp He Asn Thr Gin Arg 


Lys Lys Ser Gin Glu 






35 




40 


45 


Lys 


Met 


Arg 


Glu Val 


Thr Asp Ser Pro Gly Arg 


Pro Arg Glu Leu Thr 




50 






55 


60 


He 


Pro 


Gin 


Thr Ser 


Ser His Gly Ala Asn Arg 


Phe Val Pro Lys Ser 


65 








70 * 75 


80 


Lys 


Ala 


Leu 


Glu Ala 


Val Lys Leu Ala He Glu 


Ala Gly Phe His His 








85 


90 


95 


He 


Asp 


Ser 


Ala His 


Val Tyr Asn Asn Glu Glu 


Gin Val Gly Leu Ala 








100 


105 


110 


He 


Arg 


Ser 


Lys He 


Ala Asp Gly Ser Val Lys 


Arg Glu Asp He Phe 






115 




120 


125 


Tyr 


Thr 


Ser 


Lys Leu 


Trp Ser Asn Ser His Arg 


Pro Glu Leu Val Arg 




130 






135 


140 


Pro 


Ala 


Leu 


Glu Arg 


Ser Leu Lys Asn Leu Gin 


Leu Asp Tyr Val Asp 


145 








150 155 


160 


Leu 


Tyr 


Leu 


He His 


Phe Pro Val Ser Val Lys 


Pro Gly Glu Glu Val 








. 165 


170 


175 


He 


Pro 


Lys 


Asp Glu 


Asn Gly Lys He Leu Phe 


Asp Thr Val Asp Leu 








180 


185 


190 


Cys 


Ala 


Thr 


Trp Glu 


Ala Met Glu Lys Cys Lys 


Asp Ala Gly Leu Ala 






195 




200 


205 


Lys 


Ser 


He 


Gly Val 


Ser Asn Phe Asn His Arg 


Leu Leu Glu Met He 




210 






215 


220 


Leu 


Asn 


Lys 


Pro Gly 


Leu Lys Tyr Lys Pro Val 


Cys Asn Gin Val Glu 


225 








230 235 


240 


Cys 


His 


Pro 


Tyr Phe 


Asn Gin Arg Lys Leu Leu 


Asp Phe Cys Lys Ser 




Asp 




245 


250 


255 


Lys 


He 


Val Leu 


Val Ala Tyr Ser Ala Leu 


Gly Ser His Arg Glu 








260 


265 


270 


Glu 


Pro 


Trp 


Val Asp 


Pro Asn Ser Pro Val Leu 


Leu Glu Asp Pro Val 






275 




280 


285 


Leu 


Cys 


Ala 


Leu Ala 


Lys Lys His Lys Arg Thr 


Pro Ala Leu He Ala 




290 






295 ' 


300 


Leu 


Arg 


Tyr 


Gin Leu 


Gin Arg Gly Val Val Val 


Leu Ala Lys Ser Tyr 


305 








310 315 


320 


Asn 


Glu 


Gin 


Arg He 


Arg Gin Asn Val Gin Val 


Phe Glu Phe Gin Leu 








325 


330 


335 


Thr 


Ser 


Glu 


Glu Met 


Lys Ala He Asp Gly Leu 


Asn Arg Asn Val Arg 








340 


345 


350 


Tyr 


Leu 


Thr 


Leu Asp 


He Phe Ala Gly Pro Pro 


Asn Tyr Pro Phe Ser 






355 




360 


365 


Asp 


Glu 


Tyr 










370 











<210> 434 
<211> 1119 
<212> DNA 

<213> Homo sapiens 1 
<400> 434 

atgcagcatc accaccatca ccactggcag cccctcttct tcaagtggct cttgtcctgt 60 
tgccctggga gttctcaaat tgctgcagca gcctccaccc agcctgagga tgacatcaat 120 
acacagagga agaagagtca ggaaaagatg agagaagtta cagactctcc tgggcgaccc 180 
cgagagctta ccattcctca gacttcttca catggtgcta acagatttgt tcctaaaagt 240 
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aaagctctag aggccgtcaa attggcaata gaagccgggt tccaccatat tgattctgca 300 

catgtttaca ataatgagga gcaggttgga ctggccatcc gaagcaagat tgcagatggc 360 

agtgtgaaga gagaagacat attctacact tcaaagcttt ggagcaattc ccatcgacca 420 

gagttggtcc gaccagcctt ggaaaggtca ctgaaaaatc ttcaattgga ctatgttgac 480 

ctctatctta ttcattttcc agtgtctgta* aagccaggtg aggaagtgat cccaaaagat 540 

gaaaatggaa aaatactatt tgacacagtg gatctctgtg ccacatggga ggccatggag 600 

aagtgtaaag atgcaggatt ggccaagtcc atcggggtgt ccaacttcaa ccacaggctg 660 

ctggagatga tcctcaacaa gccagggctc aagtacaagc ctgtctgcaa ccaggtggaa 720 

tgtcatcctt acttcaacca gagaaaactg ctggatttct gcaagtcaaa agacattgtt 780 

ctggttgcct atagtgctct gggatcccat cgagaagaac catgggtgga cccgaactcc 840 

ccggtgctct tggaggaccc agtcctttgt gccttggcaa aaaagcacaa gcgaacccca 900 

gccctgattg ccctgcgcta ccagctgcag cgtggggttg tggtcctggc caagagctac 960 

aatgagcagc gcatcagaca gaacgtgcag gtgtttgaat tccagttgac ttcagaggag 1020 

atgaaagcca tagatggcct aaacagaaat gtgcgatatt tgacccttga tatttttgct 1080 

ggccccccta attatccatt ttctgatgaa tattaatga 1X19 

<210> 435 
<211> 36 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Primer 
<400> 435 

ggatccgccg ccaccatgac atccattcga gctgta 

<210> 436 
<211> 27 
<212> DNA 

<213> Artificial Sequence 



36 



<220> 

<223> Primer 
<400> 436 

gtcgactcag ctggaccaca gccgcag 

<210> 437 
<211> 37 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 437 

ggatccgccg ccaccatgga ctcctggacc ttctgct 

<210> 438 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Primer 
<400> 438 
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gtcgactcag aaatcctttc tcttgac 27 

V210> 439 
<211> 933 
<212> DNA 
<213> Homo sapiens 

<400> 439 

atggactcct ggaccttctg ctgtgtgtcc ctttgcatcc tggtagcaaa gcacacagat 60 

gctggagtta tccagtcacc ccggcacgag gtgacagaga tgggacaaga agtgactctg 120 

agatgtaaac caatttcagg acacgactac cttttctggt acagacagac catgatgcgg 180 

ggactggagt tgctcattta ctttaacaac aacgttccga tagatgattc agggatgccc 240 

gaggatcgat tctcagctaa gatgcctaat gcatcattct ccactctgaa gatccagccc 300 

tcagaaccca gggactcagc tgtgtacttc tgtgccagca gtttagttgg agcaaacact 360 

gaagctttct ttggacaagg caccagactc acagttgtag aggacctgaa caaggtgttc 420 

ccacccgagg tcgctgtgtt tgagccatca gaagcagaga tctcccacac ccaaaaggcc 480 

acactggtgt gcctggccac aggcttcttc cctgaccacg tggagctgag ctggtgggtg 540 

aatgggaagg aggtgcacag tggggtcagc acggacccgc agcccctcaa ggagcagccc 600 

gccctcaatg actccagata ctgcctgagc agccgcctga gggtctcggc caccttctgg 660 

cagaaccccc gcaaccactt ccgctgtcaa gtccagttct acgggctctc ggagaatgac 720 

gagtggaccc aggatagggc caaacccgtc acccagatcg tcagcgccga ggcctggggt 780 

agagcagact gtggctttac ctcggtgtcc taccagcaag gggtcctgtc tgccaccatc 840 

ctctatgaga tcctgctagg gaaggccacc ctgtatgctg tgctggtcag cgcccttgtg 900 

ttgatggcca tggtcaagag aaaggatttc tga 933 

<210> 440 
<211> 822 
<212> DNA 
<213> Homo sapiens 

<400> 440 

atgacatcca ttcgagctgt atttatattc ctgtggctgc agctggactt ggtgaatgga 60 

gagaatgtgg agcagcatcc ttcaaccctg agtgtccagg agggagacag cgctgttatc 120 

aagtgtactt attcagacag tgcctcaaac tacttccctt ggtataagca agaacttgga 180 

aaaagacctc agcttattat agacattcgt tcaaatgtgg gcgaaaagaa agaccaacga 240 

attgctgtta cattgaacaa gacagccaaa catttctccc tgcacatcac agagacccaa 300 

cctgaagact cggctgtcta cttctgtgca gcaagtatac tgaacaccgg taaccagttc 360 

tattttggga cagggacaag tttgacggtc attccaaata tccagaaccc tgaccctgcc 420 

gtgtaccagc tgagagactc taaatccagt gacaagtctg tctgcctatt caccgatttt 480 

gattctcaaa caaatgtgtc acaaagtaag gattctgatg tgtatatcac agacaaaact 540 

.gtgctagaca tgaggtctat ggacttcaag agcaacagtg ctgtggcctg gagcaacaaa 600 

tctgactttg catgtgcaaa cgccttcaac aacagcatta ttccagaaga caccttcttc 660 

cccagcccag aaagttcctg tgatgtcaag ctggtcgaga aaagctttga aacagatacg 720 

aacctaaact ttcaaaacct gtcagtgatt gggttccgaa tcctcctcct gaaagtggcc 780 

gggtttaatc tgctcatgac gctgcggctg tggtccagct ga 822 

<210> 441 
<211> 2311 
<212> DNA 

<213> Homo sapiens t 
<400> 441 

gatttaatcc tatgacaaac taagttggtt ctgtcttcac ctgttttggt gaggttgtgt 60 
aagagttggt gtttgctcag gaagagattt aagcatgctt gcttacccag actcagagaa 120 
gtctccctgt tctgtcctag ctatgttcct gtgttgtgtg cattcgtctt ttccagagca 180 
aaccgcccag agtagaagat ggattggggc acgctgcaga cgatcctggg gggtgtgaac 240 
aaacactcca ccagcattgg aaagatctgg ctcaccgtcc tcttcatttt tcgcattatg 300 
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atcctcgttg tggctgcaaa ggaggtgtgg ggagatgagc aggccgactt tgtctgcaac 360 
accctgcagc caggctgcaa gaacgtgtgc tacgatcact acttccccat ctcccacatc 420 
<cggctatggg ccctgcagct gatcttcgtg tccagcccag cgctcctagt ggccatgcac 480 
gtggcctacc ggagacatga gaagaagagg aagttcatca agggggagat aaagagtgaa 540 
tttaaggaca tcgaggagat caaaacccag aaggtccgca tcgaaggctc cctgtggtgg 600 
acctacacaa gcagcatctt cttccgggtc atcttcgaag ccgccttcat gtacgtcttc 660 
tatgtcatgt acgacggctt ctccatgcag cggctggtga agtgcaacgc ctggccttgt 720 
cccaacactg tggactgctt tgtgtcccgg cccacggaga agactgtctt cacagtgttc 780 
atgattgcag tgtctggaat ttgcatcctg ctgaatgtca ctgaattgtg ttatttgcta 840 
attagatatt gttctgggaa gtcaaaaaag ccagtttaac gcattgccca gttgttagat 900 
taagaaatag acagcatgag agggatgagg caacccgtgc tcagctgtca aggctcagtc 960 
gccagcattt cccaacacaa agattctgac cttaaatgca accatttgaa acccctgtag 1020 
gcctcaggtg aaactccaga tgccacaatg agctctgctc ccctaaagcc tcaaaacaaa 1080 
ggcctaattc tatgcctgtc ttaattttct ttcacttaag ttagttccac tgagacccca 1140 
ggctgttagg ggttattggt gtaaggtact ttcatatttt aaacagagga tatcggcatt 1200 
tgtttctttc tctgaggaca agagaaaaaa gccaggttcc acagaggaca cagagaaggt 1260 
ttgggtgtcc tcctggggtt ctttttgcca actttcccca cgttaaaggt gaacattggt 1320 
tctttcattt gctttggaag ttttaatctc taacagtgga caaagttacc agtgccttaa 1380 
actctgttac actttttgga agtgaaaact ttgtagtatg ataggttatt ttgatgtaaa 1440 
gatgttctgg ataccattat atgttccccc tgtttcagag gctcagattg taatatgtaa 1500 
atggtatgtc attcgctact atgatttaat ttgaaatatg gtcttttggt tatgaatact 1560 
ttgcagcaca gctgagagag gctgtctgtt gtattcattg tggtcatagc acctaacaac 1620 
attgtagcct caatcgagtg agacagacta gaagttccta gttggcttat gatagcaaat 1680 
ggcctcatgt caaatattag atgtaatttt gtgtaagaaa tacagactgg atgtaccacc 1740 
aactactacc tgtaatgaca ggcctgtcca acacatctcc cttttccatg ctgtggtagc 1800 
cagcatcgga aagaacgctg at'ttaaagag gtgagcttgg gaattttatt gacacagtac 1860 
catttaatgg ggagacaaaa atgggggcca ggggagggag aagtttctgt cgttaaaaac 1920 
gagtttggaa agactggact ctaaattctg ttgattaaag atgagctttg tctaccttca 1980 
aaagtttgtt tggcttaccc ccttcagcct ccaatttttt aagtgaaaat ataactaata 2040 
acatgtgaaa agaatagaag ctaaggttta gataaatatt gagcagatct ataggaagat 2100 
tgaacctgaa tattgccatt atgcttgaca tggtttccaa aaaatggtac tccacatact 2160 
tcagtgaggg taagtatttt cctgttgtca agaatagcat tgtaaaagca ttttgtaata 2220 
ataaagaata gctttaatga tatgcttgta actaaaataa ttttgtaatg tatcaaatac 2280 
atttaaaaca ttaaaatata atctctataa t 2311 

<210> 442 i 

<211> 226 

<212> PRT 

<213> Homo sapiens 

<400> 442 

Met Asp Trp Gly Thr Leu Gin Thr He Leu Gly Gly Val Asn Lys His 
5 10 is 

Ser Thr Ser He Gly Lys He Trp Leu Thr Val Leu Phe He Phe Arg 
20 25 30 

He Met He Leu Val Val Ala Ala Lys Glu Val Trp Gly Asp Glu Gin 
35 40 45 

Ala Asp Phe Val Cys Asn Thr Leu Gin Pro Gly Cys Lys Asn Val Cys 
50 55 60 ' 

Tyr Asp His Tyr Phe Pro He Ser His He Arg Leu Trp Ala Leu Gin 



65 70 75 



80 



Leu He Phe Val Ser Ser Pro Ala Leu Leu Val Ala Met His Val Ala 
85 90 95 
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Tyr Arg Arg His Glu Lys Lys Arg Lys Phe He Lys Gly Glu He Lys 
< 100 105 110 

Ser Glu Phe Lys Asp He Glu Glu He Lys Thr Gin Lys Val Arg He 
115 120 125 

Glu Gly Ser Leu Trp Trp Thr Tyr Thr Ser Ser He Phe Phe Arg Val 
130 135 140 

He Phe Glu Ala Ala Phe Met Tyr Val Phe Tyr Val Met Tyr Asp Gly 
145 150 155 160 

Phe Ser Met Gin Arg Leu Val Lys Cys Asn Ala Trp Pro Cys Pro Asn 
165 170 175 

Thr Val Asp Cys Phe Val Ser Arg Pro Thr Glu Lys Thr Val Phe Thr 
180 185 190 

Val Phe Met He Ala Val Ser Gly lie Cys He Leu Leu Asn Val Thr 
195 200 205 

Glu Leu Cys Tyr Leu Leu He Arg Tyr Cys Ser Gly Lys Ser Lys Lys 
210 215 220 

Pro Val 
225 



<210> 443 
<211> 23 
<212> PRT 

<213> Homo sapiens 
<400> 443 

Val Lys Leu Cys Gly He Asp Pro Cys Pro Asn Leu Val Asp Cys Phe 
5 10 15 

He Ser Arg Pro Gly Cys Gly 
20 



<210> 444 
<211> 36 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> PCR primer 

<400> 444 1 

caatcaggca tgcacaacaa actgtatatc ggaaac 36 



<210> 445 
<211> 30 
<212> DNA 
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<213> Artificial Sequence 

* <220> 
<223> PCR primer 

<400> 445 

cgtcaagatc ttcattactt ccgtcttgac 

<210> 446 
<211> 579 
<212> PRT 

<213> Homo sapiens 
<400> 446 

Met Asn Lys Leu Tyr He Gly Asn Leu Ser Glu Asn Ala Ala Pro Ser 
5 10 15 

Asp Leu Glu Ser He Phe Lys Asp Ala Lys He Pro Val Ser Gly Pro 
20 25 30 

Phe Leu Val Lys Thr Gly Tyr Ala Phe Val Asp Cys Pro Asp Glu Ser 
35 40 45 

Trp Ala Leu Lys Ala He Glu Ala Leu Ser Gly Lys He Glu Leu His 
50 55 60 

Gly Lys Pro He Glu Val Glu His Ser Val Pro Lys Arg Gin Arg He 
65 70 75 ao 

Arg Lys Leu Gin He Arg Asn He Pro Pro His Leu Gin Trp Glu Val 
85 90 95 

Leu Asp Ser Leu Leu Val Gin Tyr Gly Val Val Glu Ser Cys Glu Gin 
100 105 no 

Val Asn Thr Asp Ser Glu Thr Ala Val Val Asn Val Thr Tyr Ser Ser 
115 120 125 

Lys Asp Gin Ala Arg Gin Ala Leu Asp Lys Leu Asn Gly Phe Gin Leu 
130 135 140 

Glu Asn Phe Thr Leu Lys Val Ala Tyr He Pro Asp Glu Thr Ala Ala 
145 I 50 155 i 60 

Gin Gin Asn Pro Leu Gin Gin Pro Arg Gly Arg Arg Gly Leu Gly Gin 
165 170 175 

Arg Gly Ser Ser Arg Gin Gly Ser Pro Gly Ser Val Ser Lys Gin Lys 
180 185 190 

Pro Cys Asp Leu Pro Leu Arg Leu Leu Val Pro Thr Gin Phe Val Gly 
195 200 205 

Ala He He Gly Lys Glu Gly Ala Thr He Arg Asn He Thr Lys Gin 
210 215 220 

Thr Gin Ser Lys He Asp Val His Arg Lys Glu Asn Ala Gly Ala Ala 
225 230 235 240 



30 
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Glu Lys Ser lie Thr lie Leu Ser Thr Pro Glu Gly Thr Ser Ala Ala 
245 250 255 

Cys Lys Ser lie Leu Glu lie Met His Lys Glu Ala Gin Asp He Lys 
260 265 270 

Phe Thr Glu Glu He Pro Leu Lys He Leu Ala His Asn Asn Phe Val 
275 280 285 

Gly Arg Leu He Gly Lys Glu Gly Arg Asn Leu Lys Lys He Glu Gin 
290 295 300 

Asp Thr Asp Thr Lys He Thr He Ser Pro Leu Gin Glu Leu Thr Leu 
305 310 315 320 

Tyr Asn Pro Glu Arg Thr lie. Thr Val Lys Gly Asn Val Glu Thr Cys 
325 330 335 

Ala Lys Ala Glu Glu Glu He Met Lys Lys lie Arg Glu Ser Tyr Glu 
340 345 350 

Asn Asp He Ala Ser Met Asn Leu Gin Ala His Leu He Pro Gly Leu 
355 360 365 

Asn Leu Asn Ala Leu Gly Leu Phe Pro Pro Thr Ser Gly Met Pro Pro 
370 375 380 

Pro Thr Ser Gly Pro Pro Ser Ala Met Thr Pro Pro Tyr Pro Gin Phe 
385 390 395 400 

Glu Gin Ser Glu Thr Glu Thr Val His Leu Phe He Pro Ala Leu Ser 
405 410 415 

Val Gly Ala He He Gly Lys Gin Gly Gin His He Lys Gin Leu Ser 
420 425 , 430 

Arg Phe Ala Gly Ala Ser He Lys He Ala Pro Ala Glu Ala Pro Asp 
435 440 445 

i 

Ala Lys Val Arg Met Val He He Thr Gly Pro Pro Glu Ala Gin Phe 
450 455 460 

Lys Ala Gin Gly Arg He Tyr Gly Lys He Lys Glu Glu Asn Phe Val 
465 470 475 480 

Ser Pro Lys Glu Glu Val Lys Leu Glu Ala His He Arg Val Pro Ser 
485 490 495 

Phe Ala Ala Gly Arg Val He Gly Lys Gly Gly Lys Thr Val Asn Glu 
500 505 510 

i 

Leu Gin Asn Leu Ser Ser Ala Glu Val Val Val Pro Arg Asp Gin Thr 
515 520 525 

Pro Asp Glu Asn Asp Gin Val Val Val Lys He Thr Gly His Phe Tyr 
530 535 540 



Ala Cys Gin Val Ala Gin Arg Lys He Gin Glu He Leu Thr Gin Val 
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545 



550 



555 



560 



^Lys Gin His Gin Gin Gin Lys Ala Leu Gin Ser Gly Pro Pro Gin Ser 
565 570 575 



Arg Arg Lys 



' <210> 447 
<211> 1743 
<212> DNA 
<213> Homo sapiens 

<400> 447 

atgaacaaac tgtatatcgg aaacctcagc gagaacgccg ccccctcgga cctagaaagt 60 
atcttcaagg acgccaagat cccggtgtcg ggacccttcc tggtgaagac tggctacgcg 120 
ttcgtggact gcccggacga gagctgggcc ctcaaggcca tcgaggcgct ttcaggtaaa 180 
atagaactgc acgggaaacc catagaagtt gagcactcgg tcccaaaaag gcaaaggatt 240 
cggaaacttc agatacgaaa tatcccgcct catttacagt gggaggtgct ggatagttta 300 
ctagtccagt atggagtggt ggagagctgt gagcaagtga acactgactc ggaaactgca 360 
gttgtaaatg taacctattc cagtaaggac caagctagac aagcactaga caaactgaat 420 
ggatttcagt tagagaattt caccttgaaa gtagcctata tccctgatga aacggccgcc 480 
cagcaaaacc ccttgcagca gccccgaggt cgccgggggc ttgggcagag gggctcctca 540 
aggcaggggt ctccaggatc cgtatccaag cagaaaccat gtgatttgcc tctgcgcctg 600 
ctggttccca cccaatttgt tggagccatc ataggaaaag aaggtgccac cattcggaac 660 
atcaccaaac agacccagtc taaaatcgat gtccaccgta aagaaaatgc gggggctgct 720 
gagaagtcga ttactatcct ctctactcct gaaggcacct ctgcggcttg taagtctatt 780 
ctggagatta tgcataagga agctcaagat ataaaattca cagaagagat ccccttgaag 840 
attttagctc ataataactt tgttggacgt cttattggta aagaaggaag aaatcttaaa 900 
aaaattgagc aagacacaga cactaaaatc acgatatctc cattgcagga attgacgctg 960 
tataatccag aacgcactat tacagttaaa ggcaatgttg agacatgtgc caaagctgag 1020 
gaggagatca tgaagaaaat cagggagtct tatgaaaatg atattgcttc tatgaatctt 1080 
caagcacatt taattcctgg attaaatctg aacgccttgg gtctgttccc acccacttca 1140 
gggatgccac ctcccacctc agggccccct tcagccatga ctcctcccta cccgcagttt 1200 
gagcaatcag aaacggagac tgttcatctg tttatcccag ctctatcagt cggtgccatc 1260 
atcggcaagc agggccagca catcaagcag ctttctcgct ttgctggagc ttcaattaag 1320 
attgctccag cggaagcacc agatgctaaa gtgaggatgg tgattatcac tggaccacca 1380 
gaggctcagt tcaaggctca gggaagaatt tatggaaaaa ttaaagaaga aaactttgtt 1440 
agtcctaaag aagaggtgaa acttgaagct catatcagag tgccatcctt tgctgctggc 1500 
agagttattg gaaaaggagg caaaacggtg aatgaacttc agaatttgtc aagtgcagaa 1560 
gttgttgtcc ctcgtgacca gacacctgat gagaatgacc aagtggttgt caaaataact 1620 
ggtcacttct atgcttgcca ggttgcccag agaaaaattc aggaaattct gactcaggta 1680 
aagcagcacc aacaacagaa ggctctgcaa agtggaccac ctcagtcaag acggaagtaa 1740 
ga 1743 



<210> 448 
<211> 35 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> PCR primer 
<400> 448 

cgtactagca tatgaacaaa ctgtatatcg gaaac 
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<210> 449 
'<211> 579 
<212> PRT 
<213> Homo sapiens 

<400> 449 

Met Asn Lys Leu Tyr lie Gly Asn Leu Ser Glu Asn Ala Ala Pro Ser 
5 10 15 

Asp Leu Glu Ser lie Phe Lys Asp Ala Lys lie Pro Val Ser Gly Pro 
20 25 30 

Phe Leu Val Lys Thr Gly Tyr Ala Phe Val Asp Cys Pro Asp Glu Ser 
35 40 45 

Trp Ala Leu Lys Ala lie Glu Ala Leu Ser Gly Lys lie Glu' Leu His 
50 55 60 

Gly Lys Pro lie Glu Val Glu His Ser Val Pro Lys Arg Gin Arg lie 
65 70 75 80 

Arg Lys Leu Gin lie Arg Asn lie Pro Pro His Leu Gin Trp Glu Val 
85 90 95 

Leu Asp Ser Leu Leu Val Gin Tyr Gly Val Val Glu Ser Cys Glu Gin 
100 105 110 

Val Asn Thr Asp Ser Glu Thr Ala Val Val Asn Val Thr Tyr Ser Ser 
115 120 125 

Lys Asp Gin Ala Arg Gin Ala Leu Asp Lys Leu Asn Gly Phe Gin Leu 
130 135 140 

Glu Asn Phe Thr Leu Lys Val Ala Tyr lie Prp Asp Glu Thr Ala Ala 
145 150 155 160 

Gin Gin Asn Pro Leu Gin Gin Pro Arg Gly Arg Arg Gly Leu Gly Gin 
165 170 175 

Arg Gly Ser Ser Arg Gin Gly Ser Pro Gly Ser Val Ser Lys Gin Lys 
180 ^ 185 190 

Pro Cys Asp Leu Pro Leu Arg Leu Leu Val Pro Thr Gin Phe Val Gly 
195 200 205 

Ala lie He Gly Lys Glu Gly Ala Thr He Arg Asn He Thr Lys Gin 

210 215 220 

Thr Gin Ser Lys lie Asp Val His Arg Lys Glu Asn Ala Gly Ala Ala 
225 230 235 240 

1 

Glu Lys Ser He Thr He Leu Ser Thr Pro Glu Gly Thr Ser Ala Ala 
245 250 255 

Cys Lys Ser He Leu Glu He Met His Lys Glu Ala Gin Asp He Lys 
260 265 270 

Phe Thr Glu Glu He Pro Leu Lys He Leu Ala His Asn Asn Phe Val 
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275 



280 



285 



' Gly Arg Leu He Gly Lys Glu Gly Arg Asn Leu Lys Lys He Glu Gin 
290 295 300 

Asp Thr Asp Thr Lys He Thr He Ser Pro Leu Gin Glu Leu Thr Leu 
305 310 315 320 

Tyr Asn Pro Glu Arg Thr He Thr Val Lys Gly Asn Val Glu Thr Cys 
325 330 335 

Ala Lys Ala Glu Glu Glu He Met Lys Lys lie Arg Glu Ser Tyr Glu 
340 345 350 

Asn Asp He Ala Ser Met Asn Leu Gin Ala His Leu He Pro Gly Leu 
355 360 365 

Asn Leu Asn Ala Leu Gly Leu Phe Pro Pro Thr Ser Gly Met Pro Pro 
370 375 380 

Pro Thr Ser Gly Pro Pro Ser Ala Met Thr Pro Pro Tyr Pro Gin Phe 
385 390 395 400 

Glu Gin Ser Glu Thr Glu Thr Val His Leu Phe lie Pro Ala Leu Ser 
405 410 415 

Val Gly Ala He He Gly Lys Gin Gly Gin His He Lys Gin Leu Ser 
420 425 430 

Arg Phe Ala Gly Ala Ser He Lys He Ala Pro Ala Glu Ala Pro Asp 
435 440 445 

Ala Lys Val Arg Met Val He He Thr Gly Pro Pro Glu Ala Gin Phe 

450 455 460 

Lys Ala Gin Gly Arg He Tyr Gly Lys He Lys Glu Glu Asn Phe Val 

465 470 475 480 

Ser Pro Lys Glu Glu Val Lys Leu Glu Ala His He Arg Val Pro Ser 
485 490 495 

Phe Ala Ala Gly Arg Val He Gly Lys Gly Gly Lys Thr Val Asn Glu 
500 505 510 

Leu Gin Asn Leu Ser Ser Ala Glu Val Val Val Pro Arg Asp Gin Thr 
515 520 525 

Pro Asp Glu Asn Asp Gin Val Val Val Lys He Thr Gly His Phe Tyr 
530 535 540 

Ala Cys Gin Val Ala Gin Arg Lys He Gin Glu He Leu Thr Gin Val 
545 550 555 , 560 

Lys Gin His Gin Gin Gin Lys Ala Leu Gin Ser Gly Pro Pro Gin Ser 
565 570 575 

Arg Arg Lys 
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<210> 450 

<211> 1743 

; <212> DNA 

<213> Homo sapiens 

<400> 450 

atgaacaaac tgtatatcgg aaacctcagc 
atcttcaagg acgccaagat cccggtgtcg 
ttcgtggact gcccggacga gagctgggcc 
atagaactgc acgggaaacc catagaagtt 
cggaaacttc agatacgaaa tatcccgcct 
ctagtccagt atggagtggt ggagagctgt 
gttgtaaatg taacctattc cagtaaggac 
ggatttcagt tagagaattt caccttgaaa 
cagcaaaacc ccttgcagca gccccgaggt 
aggcaggggt ctccaggatc cgtatccaag 
ctggttccca cccaatttgt tggagccatc 
atcaccaaac agacccagtc taaaatcgat 
gagaagtcga ttactatcct ctctactcct 
ctggagatta tgcataagga agctcaagat 
attttagctc ataataactt tgttggacgt 
aaaattgagc aagacacaga cactaaaatc 
tataatccag aacgcactat tacagttaaa 
gaggagatca tgaagaaaat cagggagtct 
caagcacatt taattcctgg attaaatctg 
gggatgccac ctcccacctc agggccccct 
gagcaatcag aaacggagac tgttcatctg 
atcggcaagc agggccagca catcaagcag 
attgctccag cggaagcacc agatgctaaa 
gaggctcagt tcaaggctca gggaagaatt 
agtcctaaag aagaggtgaa acttgaagct 
agagttattg gaaaaggagg caaaacggtg 
gttgttgtcc ctcgtgacca gacacctgat 
ggtcacttct atgcttgcca ggttgcccag 
aagcagcacc aacaacagaa ggctctgcaa 
tga 



gagaacgccg ccccctcgga cctagaaagt 60 
ggacccttcc tggtgaagac tggctacgcg 120 
ctcaaggcca tcgaggcgct ttcaggtaaa 180 
gagcactcgg tcccaaaaag gcaaaggatt 240 
catttacagt gggaggtgct ggatagttta 300 
gagcaagtga acactgactc ggaaactgca 360 
caagctagac aagcactaga caaactgaat 420 
gtagcctata tccctgatga aacggccgcc 480 
cgccgggggc ttgggcagag gggctcctca 540 
cagaaaccat gtgatttgcc tctgcgcctg 600 
ataggaaaag aaggtgccac cattcggaac 660 
gtccaccgta aagaaaatgc gggggctgct 720 
gaaggcacct ctgcggcttg taagtctatt 780 
ataaaattca cagaagagat ccccttgaag 840 
cttattggta aagaaggaag aaatcttaaa 900 
acgatatctc cattgcagga attgacgctg 960 
ggcaatgttg agacatgtgc caaagctgag 1020 
tatgaaaatg atattgcttc tatgaatctt 1080 
aacgccttgg gtctgttccc acccacttca 1140 
tcagccatga ctcctcccta cccgcagttt 1200 
tttatcccag ctctatcagt cggtgccatc 1260 
ctttctcgct ttgctggagc ttcaattaag 1320 
gtgaggatgg tgattatcac tggaccacca 1380 
tatggaaaaa ttaaagaaga aaactttgtt 1440 
catatcagag tgccatcctt tgctgctggc 1500 
aatgaacttc agaatttgtc aagtgcagaa 1560 
gagaatgacc aagtggttgt caaaataact 1620 
agaaaaattc aggaaattct gactcaggta 1680 
agtggaccac ctcagtcaag acggaagtaa 1740 

17*43 



<210> 451 
<211> 25 
<212> PRT 

<213> Homo sapiens 
<400> 451 

Leu Gly Lys Glu Val Arg Asp Ala Lys lie Thr Pro Glu Ala Phe Glu 
5 10 is 

Lys Leu Gly Phe Pro Ala Ala Lys Glu 
20 25 



<210> 452 
<211> 25 
<212> PRT 

<213> Homo sapiens 



<400> 452 
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Lys Ala Ser Asp Gly Asp Tyr Tyr Thr Leu Ala Val Pro Met Gly Asp 
5 10 i 5 

Val Pro Met Asp Gly He Ser Val Ala 
20 25 



<210> 453 
<211> 16 
' <212> PRT 

<213> Homo sapiens 

<400> 453 

Pro Asp Arg Asp Val Asn Leu Thr His Gin Leu Asn Pro Lys Val Lys 
5 10 is 

<210> 454 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 454 

Lys He Ala Pro Ala Glu Ala Pro Asp Ala Lys Val Arg Met Val He 
5 10 15 

He Thr Gly Pro 
20 



<210> 455 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 455 

Pro Asp Glu Thr Ala Ala Gin Gin Asn Pro Leu Gin Gin Pro Arg Gly 
5 10 15 

Arg Arg Gly Leu 
20 



<210> 456 
<211> 20 
<212> PRT 

<213> Homo sapiens 
<400> 456 

Arg Thr He Thr Val Lys Gly Asn Val Glu Thr Cys Ala Lys Ala Glu 
5 10 ,15 

Glu Glu He Met 
20 

<210> 457 
<211> 20 
<212> PRT 
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<213> Homo sapiens 
'<400> 457 

Ala Phe Val Asp Cys Pro Asp Glu Ser Trp Ala Leu Lys Ala lie Glu 
5 10 15 

Ala Leu Ser Gly 
20 



<210> 458 

<211> 20 

<212> PRT 

<213> Homo sapiens 

<400> 458 

lie Arg Lys Leu Gin lie Arg Asn lie Pro Pro His Leu Gin Trp Glu 
5 10 15 

Val Leu Asp Ser 
20 



<210> 459 

<211> 20 

<212> PRT 

<213> Homo sapiens 

<400> 459 

Ala Gin Gin Asn Pro Leu Gin Gin Pro Arg Gly Arg Arg Gly Leu Gly 
5 10 15 

Gin Arg Gly Ser 
20 



<210> 460 

<211> 20 

<212> PRT 

<213> Homo sapiens 

<400> 460 

Asp Val His Arg Lys Glu Asn Ala Gly Ala Ala Glu Lys Ser lie Thr 
5 10 15 

lie Leu Ser Thr 
20 



<210> 461 

<211> 20 

<212> PRT 

<213> Homo sapiens 



<400> 461 

Leu Tyr Asn Pro Glu Arg Thr lie Thr Val Lys Gly Asn Val Glu Thr 
5 10 15 



WO 02/00174 



POYUS01/21065 



197 



Cys Ala Lys Ala 
20 



<210> 462 

<211> 20 

<212> PRT 

<213> Homo sapiens 

<400> 462 

Glu Glu Glu lie Met Lys Lys lie Arg Glu Ser Tyr Glu Asn Asp lie 
5 10 15 

Ala Ser Met Asn 
20 



<210> 463 

<211> 20 

<212> PRT 

<213> Homo sapiens 

<400> 463 

Leu Asn Ala Leu Gly Leu Phe Pro Pro Thr Ser Gly Met Pro Pro Pro 
5 10 15 

Thr Ser Gly Pro 
20 



<210> 464 

<211> 20 

<212> PRT 

<213> Homo sapiens 

<400> 464 1 

Lys lie Ala Pro Ala Glu Ala Pro Asp Ala Lys Val Arg Met V<al He 
5 10 15 

He Thr Gly Pro 
20 

<210> 465 

<211> 18 

<212> PRT 

<213> Homo sapiens 

<400> 465 

Thr Gly Tyr Ala Phe Val Asp Cys Pro Asp Glu Ser Trp Ala Leu Lys He 
5 10 15 

Glu 



<210> 466 

<211> H 

<212> PRT 

<213> Homo sapiens 
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<400> 466 

Phe Val Asp Cys Pro Asp Glu Ser Trp Ala Leu 
* 5 10 



<210> 467 

<211> 33 

<212> DNA 

<213> Homo sapiens 



<400> 467 

ttcgtggact gcccggacga gagctgggcc etc 



33 



